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1. — Introduction. 


The differential cross-section for the scattering at small angles of 970 MeV 
protons from Carbon has been measured in order to examine the interference 
between the Coulomb and nuclear parts of the scattering at this energy. The 
measurements have been made at laboratory angles from 1° to 335 


2. — Experimental arrangement. 


The proton beam of the Birmingham synchrotron was scattered from a 
carbon target placed inside the vacuum box. Protons elastically scattered 
from this target through 54°, passed through a thin window, and were focussed 
to a nearly parallel beam by a set of 
steel focussing shims [1] attached to 
the synchrotron magnet. This beam CORTE 
then passed through a hole in the WALL 
concrete shielding wall into the ex- 
perimental area. The vertical height SCATTERER 5 
of the beam was reduced by use of a IA 
tungsten alloy channel; a carefully 
positioned iron channel then allowed 
the main beam to pass through it 
but removed unwanted background. 
(See Fig. 1). 

At the experimental position the beam was 4 cm high and had a horizontal 
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Fig. 1. — Experimental arrangement 
(Not to scale). 


(*) Lavoro presentato al XLIII Congresso Nazionale della Societa Italiana di 
Fisica, Padova-Venezia, 22-28 Settembre 1957. 
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distribution which was close to that of a gaussian with standard deviation | 
1em. The total beam intensity was ~ 104 particles/pulse. The scattering target 
for the experiment was placed at a point 2 metres from the exit of the iron channel 
and particles scattered by the target were observed in photographic emulsions 


placed 1 metre away from it. 


The unscattered beam also pasesd through the 


centre of the plates, giving an angle to which the scatterings could be referred. 


3. — Observations. 


At any point on one of the plates tracks of protons scattered from the 
carbon block, together with a usually much larger number emerging from the 
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Fig. 2. — Target and plate geometry. 


Measurements of angles to the nearest }° 
unscattered beam in the centre of the plates. 
tered tracks from those in the larger peak 
proved possible at angles of scatter down 
to 1°. Reference to Fig. 2, shows that par- 
ticles scattered from the target, at a given 
angle, should appear only in a certain region 
on each plate, of width equal to the beam 
width. At the smallest angles this region 
merged with the main beam in the plates, 


Fig. 3. — Histograms of the number of tracks as 

a function of angle for a certain plate area, 

with and without the second carbon target 
in position. 


iron channel, can be found. 
Fig. 2 shows the geometrical 
arrangement and the way 
in which these two groups of 
tracks can be separated by 
measurements of the angles. 
Fig. 3 shows histograms of 
the number of tracks ob- 
served as a function of angle 
for typical parts of a plate, 
with and without the second 
arbon target in position. 
The subsidiary group of 
tracks due to the particles 
scattered by the target is 
clearly seen. 

were made with respect to the 

A clear separation of the scat- 


40 


No 
Target 


20+ 


N2 of Tracks 


10 


Degrees 


I 
— \ 


SMALL ANGLE SCATTERING OF 790 MeV PROTONS FROM CARBON 3 


and observations were not possible over the whole width. A geometrical cor- 
rection was therefore necessary, which was rather large for the points at 1° 
and 14° but negligible at larger angles. 


4. — Results and discussion. 


The values obtained for the differential cross-section are shown in Fig. 4. 
The errors shown are based on the standard deviations of the numbers of tracks 
counted. In addition to the geometrical correction, a correction was also 
applied to the small angle points to allow for 
multiple and plural Coulomb scattering. This 50 
correction was appreciable only at angles less 
than 14°. 

The beam monitoring was inadequate to give 
absolute cross-sections to better than +50%. 
Therefore, the results have been normalized to 
the theoretical curve shown in Fig. 4, which was 
obtained in the following way. The nuclear part 
of the scattering was calculated on the assumption 
that the target nucleus appears as a black dise of 
radius of 2.86-:10-!3 em, and it was normalized to 
a total elastic cross-section of 180 mb as determined 
by counter experiments at this energy. The Cou- 
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5 
two were combined using the formula 
5 Pig. 4 — Differential 
do(0) | scattering cross-section of 
Tor | Ro(0) |? +110) + L(8) |? ; 970 MeV protons from 
carbon at angles between 
where À, and J, are the real and imaginary parts 1° and 37°; the full line is a 


of the Coulomb scattering amplitude and J, is the theoretical curve (see text). 


purely imaginary nuclear scattering amplitude. 

Within the limits of experimental error it is clear that a good fit is obtained 
without invoking any real part to the nuclear scattering. However, owing 
to the existence of the term J,(0) some interference does occur and makes 
the present results rather insensitive to a small real nuclear potential. The 
experiment would be more sensitive if the absolute values of the differential 
scattering cross-section were known and experiments to obtain this information 
are now in progress. 
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Sono ormai ben noti quei centri di colore caratteristici — centri Z, e Z, — 
che si formano con i normali metodi di colorazione nei cristalli di alogenuri 
alcalini contenenti impurezze bivalenti positive. Vari autori hanno contri- 
buito allo studio delle principali proprietà di questi centri [1-5], e le ricerche 
condotte fin qui hanno permesso di vagliare alcuni modelli sulla natura dei 
centri stessi. In Fig. 1 sono illustrati i modelli di Pick 
e di Seitz. 

In recenti lavori svolti dal nostro gruppo in questo 
campo, l’interesse era già stato portato alle trasforma- 
zioni fra centri F e centri Z, provocate mediante ecci- 
tazione ottica o termica. Si rimanda a quanto è stato 
pubblicato sull’argomento per maggiori dettagli [4, 5]. 
Oggetto della presente relazione sono alcune misure 
sulle conversioni termiche fra centri F e centri Z,, à 


a) Z, centro (Pick) 


ab ocaten (Piet) varie temperature. Fer studiare la reazione F = Z,, si 
c) Z, centro (Seitz) determinano le variazioni (provocate da annealing) nel- 
d) Z, centro (Seitz) À 


l’altezza delle bande di assorbimento # e Z,, come 
mezzo per risalire alle variazioni di concentrazione dei 
centri di colore interessati. 

Le misure di assorbimento sono state eseguite per mezzo di uno spettro- 
fotometro convenzionale già descritto [4], composto di: sorgente luminosa, 
monocromatore, cella di assorbimento, rivelatore a cellula fotoelettrica, La 
cella porta-cristallo fa parte di un criostato che permette di fare le osserva- 
zioni alla temperatura dell’aria liquida. Inoltre la disposizione sperimentale 
permette di effettuare sul posto, negli intervalli fra le misure, il riscaldamento 
controllato del cristallo colorato. 

Si sono adoperati per le misure cristalli di KCl, cresciuti dal fuso con pic- 


Fig. 1. 


(*) Lavoro presentato al XLIII Congresso Nazionale della Società Italiana di 
Fisica, Padova-Venezia, 22-28 Settembre 1957. 
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cole addizioni di SrOl,: il tenore finale di Sr++ era dell’ordine di 10-* molare. 
La colorazione per eccesso stechiometrico veniva effettuata mediante cottura in 


i vapori di Na a — 570 °C. Per uniformare 
campioni, e anche per rendere possibile 
la creazione di grosse bande Z,, i cristalli 
venivano raffreddati lentamente fino a 
~ 220 °C, ivi tenuti per (15-20) minuti, 
e infine portati a temperatura ambiente 
con un quenching veloce. 


Trasformazione termica I’ — Z,. 


Scaldando cristalli colorati con banda 
F (accompagnata o meno da banda Z, pri- 
maria) a temperature fra 80 °C e 110 °C, 
si è notata una distruzione della F con for- 
mazione e/o crescita della Z,. Questa con- 
versione avviene direttamente al buio. 

Le osservazioni sono state effettuate 
riscaldando i cristalli a temperature con- 
trollate per tempi diversi. La conversione 
è stata così osservata a 110 °C, 100 °C, 
90 °C. La sua velocità è ancora apprezza- 
bile a 80°C. Un esempio delle variazioni 
provocate da annealing nello spettro di 
assorbimento è dato in Fig. 2). Da mi- 
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Fig. 2. — Effetto del riscaldamento 
a 100 °C sullo spettro di assorbimento 
(bande Fe Z,). Osservazioni eseguite 
a temperatura dell’aria liquida. © ini- 
ziale; O dopo 44 minuti a 100 °C?e 


dopo 225 minuti a 100 °C. 


sure sulla variazione temporale del numero di centri (a 100 °C) risulta 
che le concentrazioni di F e Z, tendono, per lunghi tempi di cottura, a 
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tempo di annealing 


un valore di equilibrio 
(v. esempio di Fig. 3). 
Questi risultati comple- 
tano osserva- 
zioni precedenti, sulla tra- 
sformazione F +> Z, a 
110 °C [5]. 


alcune 


Fig. 3. — Variazione tempo- 

rale del numero di centri F 

e Z, (per cm?) in seguito ad 
«annealing » a 100 °C. 
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Trasformazione termica Z, — F. 


Come si vede nell’esempio di Fig. 4, esisteanche la trasformazione inversa: 
una grossa banda Z, secondaria, creata dalla banda F per conversione ter- 
mica a — 105 °C, regredisce quando si scalda il cristallo ad una temperatura 
suepriore (7— 158 °C); contemporaneamente si riforma la F. Anche in questo 
caso, la banda F sembra raggiungere un valore di saturazione; non è chiaro, 
invece, se la banda Z, raggiunga l’equilibrio, o continui lentamente a diminuire. 

Allo stesso modo si comporta la banda Z, secondaria di Fig. 5, ottenuta 
per distruzione della Z, ad una certa temperatura (7= 96°C): continuando 
il riscaldamento a quella temperatura, la Z, creata «artificialmente » dimi- 
nuisce, e si riforma la F. 
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Fig. 4. — Effetto del riscaldamento a 158°C sulle bande F e Z,. (Osservazioni effet- 

tuate alla temperatura dell’aria liquida). @ iniziale (banda Z, creata per conversione 

dalla F a 105 °C); e dopo 20 min a 158 °C; © dopo 40 min a 158 °C; Xx dopo 
60 min a 158 °C; m dopo 80min a 158 °C. 


I fatti esposti si possono interpretare in termini di una ben definita « rea- 
zione » fra centri F e centri Z,, analoga ad una reazione chimica. Con il mo- 
dello di Seitz (v. Fig. 1) che è quello generalmente accettato, la conversione 
F — 7, si può spiegare nel seguente modo: 


2 +44 ++ 
= 


Sa Fl — | 
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Vale a dire, i complessi neutri (Srt+-vacanza di ione positivo) migrano verso 
i centri F per effetto della agitazione termica, e coagulano con essi a formare Z,. 
D'altra parte, il processo è dinamico, ed esiste anche la reazione inversa per 
la quale i centri Z, si dissociano. 
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Fig. 5. — © iniziale (banda F e banda Z,); e dopo 16 min a 96 °C (banda / e Z,); 

@ dopo 32 min a 96 °C (banda Fe Zs); @ dopo 107 min a 96 °C (banda F e Z,)- 

x dopo 190 min a 96 °C (banda F e Z;). (Osservazioni effettuate a temperatura del; 
l’aria liquida. 


Se quello presentato è il meccanismo giusto, si può applicare alla reazione 
globale la legge dell’azione di massa (usando, per i centri di colore nel cristallo, 
l’analogia con le soluzioni diluite): 


(1) 


dove K è una costante di reazione dipendente dalla sola temperatura. In 
questo modo si spiegherebbe il verso della reazione nei diversi casi, in dipen- 
denza dai valori relativi iniziali delle concentrazioni. 

Dalla (1) si può ricavare direttamente il calore di attivazione della reazione 
F ZZ,: introducendo in essa i valori sperimentali del rapporto Ca, als 
l'equilibrio, relativi a due diverse temperature T, e T,, si trova il valore del 
rapporto K(7,)/K(T.)= exp [(0/2)((1/T,) — (1/T,))], e quindi si ricava il calore 
globale di attivazione, Q. La cosa è possibile poichè le Cos incognite si eli- 


minano: infatti la concentrazione dei complessi neutri (Srt*-vacanza) è piut- 
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tosto insensibile a variazioni anche notevoli di temperatura, come si deduce 
da una conoscenza teorica dell’energia di dissociazione [6] che è circa 0.4 eV. 
La CH si può allora trattare come una costante su piccoli intervalli di tem- 
peratura. Il calore di attivazione così stimato, utilizzandoi valori all’equilibrio 
delle C per 100 °C e 158 °C, è Q — 0.32 eV/centro, che sembra un valore ra- 
gionevole. Come è noto, Y rappresenta la differenza fra i calori di attivazione 
delle reazioni componenti. 

Le precedenti considerazioni sembrano far ritenere che il modello di Seitz 
è plausibile. 

Una estensione delle presenti ricerche sull’annealing dei centri Z, e Z, è 
in programma, e si spera che possa dare qualche ulteriore chiarimento. 
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1. — Introduccion. 


El estudio de la reaccién ’Li(d, n)*Be, mediante la técnica fotografica, ha 
sido objeto de una serie de trabajos anteriores [1-3] en los cuales determinamos 
por primera vez, algunos de los niveles energéticos del Be. Nuestros resultados, 
que coincidieron con los obtenidos independendientemente por otros Centros [4, 5] 
fueron discutidos y puestos en duda en la Conferencia de Glasgow en 1954 [6]; 
de ello surgié el proyecto de una nueva experiencia para tratar de confirmar 
la existencia de estos nuevos posibles niveles del *Be [7]. 

Esta experiencia fué planeada juntamente con el Dr. Gipson de la Uni- 
versidad de Belfast, y el Grupo de Fisica Nuclear de Aldermaston, siendo 
realizada en dicho Centro a principios del afio 1955. Los neutrones produ- 
cidos al bombardear un blanco de ‘Li, isotépicamente puro, con deuterones 
de 1080 keV, acelerados en el Van der Graaf de Aldermaston, fueron recogidos 
en un lote de 12 placas fotonucleares Ilford C-2, lo que nos permitié realizar 
un estudio de la distribucién angular de los neutrones producidos en dicha 
reaccién correspondientes a los distintos estados excitados del *Be. 

Las placas impresionadas, una vez reveladas, fueron examinadas en los 
Laboratorios de la Seccién de Valencia del Instituto de Optica « Daza de 


Valdés ». 


(*) Este trabajo ha sido subvencionado por la Junta de Energia Nuclear Espafiola. 
(+) Lavoro presentato al XLIII Congresso Nazionale della Società Italiana di 
Fisica, Padova-Venezia, 22-28 Settembre 1957. 
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2. — Proceso experimental. 


La camara de exposicién y los datos experimentales con que fué realizada 
han sido descritos en [7]. Las doce placas estaban situadas alrededor del blanco 
a intervalos angulares de 15° y a una distancia de 16 cm del mismo. El examen 
de las placas se realizé mediante dos microscopios C.T.S. M4000, estudiando 

un promedio de 1500 trazas 


agg de protones de retroceso en 


Lee cada una de ellas, segün el 
d i Trazas medidas 


método descrito por POWELL [8] 
y RICHARDS [9]. 

Para el calculo de la energia 
de los neutrones, utilizamos un 
sistema original de nomogra- 
mas, cuya descripcién puede 
verse en [10], sin que fuera 
afectado por ello la precisién 
de la medidas. 

Para obtener la distribucién 
espectral correcta del nimero 
de neutrones, fué necesario 
multiplicar el nimero de trazas 

- observadas correspondientes a 
cada energia, por un factor inversamente proporcional a la seccién eficaz de 
difusién (n-p) [11] y a la fraccién de dichas trazas que no escaparon del seno 
de la emulsién [12]. Con ello se ha obtenido un histograma para cada Angulo 
como el representado en la Fig. 1 correspondiente al angulo de 135°, en donde 
pueden verse grupos homogéneos correspondientes al estado fundamental ya 
los siguientes estados de excitacién del “Be : 1.37, 2.09, 2 81, 3.52, 4.20, 5 01, 
5.91 y 6.68 MeV. 


Nümero de trazas 


3. — Secciones eficaces elementales. 


El calculo de las secciones eficaces elementales correspondientes al estado 
fundamental y a cada uno de los niveles energéticos citados, se ha realizado, 
siguiendo el método descrito en [13] a partir de las condiciones geométricas 
de la exposicién, angulo y distancia al blanco de la zona de emulsién estudiada 
y numero total de neutrones correspondientes a cada maximo. 


on 


Late. 
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Los resultados, referidos sof Estado fundamental 
en coordenadas C.M., vienen 
representados graficamente 
en la Fig. 2. 

Se han incluido en una Vos 300 60° 90° 120° 190° 
sola distribucién angular, los 
neutrones correspondientes 
al- -triplete 2.09, 2.81, 
3.52 MeV, pues, como puede 
verse en la Fig.1, los mà- 
ximos correspondientes se 
solapan en sus bases de tal 
modo que es imposible el 
computo de los neutrones 
pertenecientes a cada uno 
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4. — Secciones eficaces totales. 30 E*= 5'91 MeV 


La seccion eficaz total 
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correspondiente a cada uno 
de los niveles de excitacién 
citados, ha sido calculada a 
partirde la distribucién angular y ha dado los resultados expresados en la Tabla I. 

Como resultado de este trabajo, confirmamos la presencia de los niveles 
del “Be claramente establecidos con anterioridad en 2.8, 4.2 y 5.0 MeV. Asi- 


Fig. 2. 


Tapia I. — Secciones eficaces totales de la reaccién "Li(d, n)$Be correspondientes a los 
distintos niveles de excitacibn del ‘Be. 


Nivel Secciòn eficaz total | 
| 
Estado fundamental | 0.45 + 0.06 mb | 
1.37 MeV | 0.16 + 0.05. » | 
| 2.09 » : | = | 
| 2.82 » 1.56 + 0.14 » | 
| 3702 => — | 
4.20 » 0.23 £0.05 |» | 
50100) 0.23 + 0.05 » | 

| 5.91 » 0.16 + 0.04 » 
| 6.68: » 0.14 + 0.04 | 
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_mismo acusamos nuevamente la presencia de los niveles 2.09 y 3.52 MeV men- 

cionados por primera vez en un trabajo anterior [1] y que fueron citados 
también por otros investigadores [4, 5]. Analogamente señalamos la presencia 
de un nivel a 5.91 MeV citado con anterioridad por GowArD et al. [14], y de 
6.68 MeV mencionado por ERDOS et al. [5]. 


[1] 
[2] 


[3] 
[4] 
[5] 
[6] 


[7] 
[8] 
[9] 
[10] 
[11] 
[12] 
[13] 


[14] 
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Strumento per la misura della contaminazione dell’aria 
dovuta a pulviscolo contenente uranio, 
insensibile alla radioattivita dell’aria (*). 


C. CoTTINI, E. GATTI and A. MALVICINI 
Laboratori CISE - Milano 


Il monitore che descriviamo è stato studiato per il reparto di Metallurgia 
dell’uranio del CISE e serve a controllare durante la fase di lavorazione la 
contaminazione dell’aria dovuta a pulviscolo contenente uranio. Lo strumento 
è a funzionamento continuo ed è sensibile ad una concentrazione inferiore alla 
massima permissibile per esposizione professionale. 

Come è noto la maggiore difficoltà nel rivelare tracce di uranio in aria è data 
dalla presenza di emanazione in quantità molto variabile con le condizioni 
metereologiche e la ventilazione dell'ambiente. L’attività « dei prodotti di 
decadimento dell'emanazione è in genere maggiore rispetto a quella della 
concentrazione massima permissibile di uranio e pertanto maschera l’attività « 
dell’uranio. 

La discriminazione fra le due attività viene fatta seguendo il decadimento 
del deposito attivo raccolto su carta da filtro mediante aspirazione, oppure 
aspettando che l’attività del RaC’ sia completamente decaduta (ossia qualche 
ora). Un altro metodo usato per discriminare un emettitore « a vita lunga dai 
prodotti dell'emanazione, senza dover aspettare che la loro attività sia deca- 
duta, consiste nella misura energetica delle particelle x emesse dal deposito 
attivo raccolto mediante precipitazione elettrostatica su un nastro trasporta- 
tore di alluminio. La difficoltà di questo procedimento consiste nel disporre 
di un rivelatore che dia impulsi proporzionali all’energia delle particelle «. 
La camera di ionizzazione a impulsi, che fornisce un’ottima risoluzione in 
energia, non si presta a funzionare con continuità. 

Come rivelatore è stato talvolta usato il cristallo di CsI* sotto forma di 
lamina sottile che dà una buona risoluzione (circa 10%) ed è pressochè insen- 
sibile all'umidità dell’aria. 


(*) Lavoro presentato al XLIII Congresso Nazionale della Società Italiana di 
Fisica, Padova-Venezia, 22-28 Settembre 1957. 
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Con la discriminazione energetica delle particelle x, oltre al discriminatore 
ad uno o due canali, è necessaria una buona stabilizzazione della tensione di 
alimentazione del fotomoltiplicatore. Un’altra difficoltà non trascurabiie con- 
siste nel disporre di un rivelatore di CsI* avente una superficie conveniente- 
mente estesa. 

Il monitore è rappresentato schematicamente nella Fig. i. Il deposito attivo 
viene raccolto su un nastro di alluminio che passa con continuità attraverso 
un precipitatore elettrostatico e fra due scintillatori di ZnS*. 


ARE Generatore 


precipitatore |elettrostatico 10kV 


imp 
neg. 


rullo del nastro 
di alluminio 


aspiratore 


imp. 
pos. 


Discriminatore 


Fig. 1. — Schema di funzionamento del monitore. 


Il nastro di alluminio ha uno spessore di circa 4 mg/em:, tale cioè da assor- 
bire quasi totalmente le particelle x dell’uranio e circa il 40% delle particelle x 
del RaC’ (che rappresenta quasi tutta l’attività dovuta ai prodotti di deca- 
dimento del radon). 

Il rivelatore che esplora direttamente la faccia con il deposito ha un dia- 
metro di 8 cm, l’altro ha un diametro di i2 em. In queste condizioni i due 
contatori danno un numero uguale di impulsi (fluttuazioni statistiche a parte) 
in presenza di sola contaminazione dovuta ad emanazione. Se il deposito 
raccolto contiene anche attività x dovuta ad uranio questa verrà rivelata solo 
da uno dei due contatori. Gli impulsi dei due contatori vengono inviati ad un 
frequenzimetro differenziale sensibile solo alla differenza della frequenza di 
impulsi dei due canali di misura, che segnerà solo l’attività x dell’uranio. 

Con un flusso di 500 litri di aria al minuto attraverso al precipitatore ed 


una velocità del nastro di 0.5 cm/min il deposito esplorato dal contatore desti- 


nato a misurare l’attività dell'uranio è quello contenuto in 5.5 m° di aria ; 
l’attività « relativa ad una concentrazione massima permissibile per esposi- 
zione professionale è di circa 500 imp/min. L'attività x dovuta al RaC' rive- 
lata dai due contatori è dell’ordine del migliaio in un ambiente moderatamente 
ventilato. 


Con un periodo di integrazione del frequenzimetro di 40 secondi, le flut- 
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tuazioni statistiche dovute all’attività del RaC’ sono inferiori a 1/5 del valore 
relativo ad una dose permissibile di uranio. 

La Fig. 2 mostra lo schema di funzionamento del frequenzimetro a tran- 
sistori: la tensione è fornita da una pila di 6 volt. 

Il frequenzimetro è 
costituito da due discri- RS ELI 
minatori ad oscillatore 
bloccato, direttamente 
alimentati dai fototubi, 
e da un voltmetro. 

I due discriminatori 
forniscono impulsi di 
area costante uguaie per 
entrambi, perciò le ten- 
sioni medie di uscita à 
sono proporzionali alla 
frequenza degli impulsi 
e la costante di propor- ? 
zionalita è uguale per i | 

| 
| 


IE 


C45 2502 
SN 


x 
022 
WN 


Transitori OC 44 | 1 


Diodi al germanio OA5 I 


due discriminatori. 

Il voltmetro è con- 
nesso in modo da misu- Fig. 2. — Schema elettrico del frequenzimetro. 
rare la media della diffe- 
renza fra le tensioni di uscita e quindi dà direttamente la differenza fra le fre- 
quenze di impulsi nei due canali. 

I discriminatori ad oscillatore bloccato sono del tipo a reazione interrotta 
da un diodo: Il transistore è sempre in conduzione con corrente di emettitore 
stabilizzata. Il diodo che interrompe la reazione è in serie al circuito di base 
ed è mantenuto interdetto da una tensione di blocco prefissabile. 

La corrente di uscita del fototubo è inviata alla base del transistore, viene 

amplificata da questo e determina una tensione indotta ai capi dell’avvolgi- 
mento di base che tende a neutralizzare la tensione di blocco del diodo; quando 
questo diventa conduttore il circuito di reazione si chiude e viene generato 
un impulso che termina per saturazione del ferro del trasformatore. 
“Il circuito richiede due compensazioni. La prima serve a mantenere indi- 
pendente dalla frequenza degli impulsi la corrente di riposo dell’emettitore, 
ed è data da un transistore ausiliario che fornisce all’emettitore la corrente 
durante gli impulsi stessi. 

La seconda ha lo scopo di eliminare le variazioni con la temperatura della 
soglia di scatto, imputabili alle variazioni della caratteristica del diodo di blocco 
della reazione e della tensione di entrata del transistore che corrisponde al 
valore di corrente nel circuito di reazione per cui il diseriminatore scatta. La 
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compensazione è ottenuta aggiungendo alla tensione di soglia le tensioni ai 
capi di due diodi in serie tenuti in conduzione a corrente costante. “A 

Gli impulsi di uscita degli oscillatori bloccati durano 150 us. Il discrimi- 
natore deve essere comandato da un generatore ad alta impedenza e quindi 
è comandato da corrente. La sua sensibilità riportata in tensione è pari ad un 
impulso di 50 mV ai capi di un condensatore di 20 pF. 
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—_ + 
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Fig. 3. — Attività del deposito raccolto in funzione della distanza dall’ingresso dell’aria. 


Pure l’alta tensione dei fotomoltiplicatori è a transistori. Il precipitatore 
elettrostatico è formato da 50 punte distanti circa 1 em dal nastro di alluminio 
tenute alla tensione di circa 10 kV da un alimentatore a transistori. 

L'efficienza di raccolta del precipitatore elettrostatico si è dimostrata abba- 
stanza buona. 

Nella Fig. 3 è rappresentato il grafico dell’attività dovuta al RaC’ del depo- 
sito raccolto in funzione della distanza dall’ingresso nel precipitatore per una 
velocità dell’aria di 3 m/s. Come si vede la maggior parte del deposito è rac- 
colto nei primi centimetri del precipitatore. 

Come aspiratore è stata usata nelle prove preliminari una pompa rotativa 
che verrà sostituita da una ventolina, alimentata a pile. Questo è possibile 
in quanto Varia attraverso il precipitatore non incontra apprezzabile resistenza. 


SUPPLEMENTO AL VOLUME IX, SERIE X Nee OOS 
DEL NUOVO CIMENTO 3° Trimestre 


Examples of Direct Emission of f-rays after Capture 
of a y-Meson by an Argon Nucleus (*). 


R. A. DONALD and G. R. EVANS 
Department of Natural Philosophy, The University of Edinburgh - Edinburgh 


Among photographs of cosmic rays obtained with a Wilson cloud chamber 
filled with 75 atmospheres of argon, three examples have been found of 
particles, identified as u-mesons, stopping in the chamber and subsequently 
emitting f-rays of energy approximately 2.8 MeV (see Fig. 1, 2, 3). In two 
of these cases the electron stops in the chamber. From the number of iden- 
tified u-e decays, the positive excess of u-mesons, and the capture probability 
for negative u-mesons in argon, it is calculated that approximately 35 stopped 
u-mesons are contained in the photographs in question. It is most unlikely 
that the events can be normal u-e decays since very few decay electrons fall 
in the energy band (0-5) MeV. The Michel spectrum predicts that, on average, 
less than 0.03 such decays would be observed for the conditions of our ex- 
periment. It is also impossible that they could be examples of Auger electrons 
consequent on capture of the meson into an extra-nuclear orbit, since the 
energy of the 1S state of a u-mesonic argon atom is only 1 MeV. 


1. — Identification of the stopping particles. 


Event 4418 (see Fig. 1). The whole sequence is r-u-e. The appearance 
is completely typical of z-u-e decays in our chamber (see Fig. 4 for a normal 
r-u-e decay). In event 4418, the z-meson must have decayed in flight, since 
the u-meson has a range of 1.3 cm as opposed to the normal range of 1.6 cm. 
In order to satisfy the dynamics of the event, the velocity of the z-meson at 
the point of decay must have been /= 0.25. The velocity of the u-meson 
with 1.3 cm residual range is also 0.25. The appearances of the tracks are quite 
consistent with these values. 


(*) Lavoro presentato al XLIII Congresso Nazionale della Societa Italiana di 
Fisica, Padova-Venezia, 22-28 Settembre 1957. 
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Event 4122 (see Fig. 2). It is obvious from the appearance of the event 
that there is a decay process involved. The minimum ionization track is an 
electron, from the observed scattering. The mass of the parent, determined 
by multiple scattering vs. residual range measurements, is 40 +% m,. Since 


no examples of x-e decays have been found, it is identified as a u-meson. 


Event A 1414 (see Fig. 3). In this case a magnetic field of 3300 gauss 
was operated with the chamber and the sign of the stopping particle and the 
electron is known to be negative. Curvature measurements against residual 
range (assuming the decay process takes place at the point where there is a 
sharp change of curvature) give a mass value of (150+100)m,. Measure- 
ments of track thicknesses are consistent with the assumption of a pu-meson 
coming to rest. The presence of a 3-ray 7.6 mm from the end of the track 
indicates that the mass cannot be greater than 250 m,. 


TABLE I. — Energy of the secondary particle. 


Event | Method of Measurement | Energy 
| J A : ca A Me ” 
4418 | Range — 11.3 em at 76 atmospheres 2.6 + 0.1 MeV 
4122 3 Multiple scattering | 3.9 + 0.9 MeV 
G A 1414 Range in chamber=12.3 cm at 80 atmospheres 2.9 + 0.1 MeV 


2. — Interpretation of the events. 


Each event is consistent with the u-meson being negatively charged. It 
is therefore possible that the electron is emitted by the residual chlorine nucleus 
after capture of the u-meson. In order that we could observe the @-particle, 
it would be necessary for it to be emitted within 200 milliseconds of the capture, 
and from the appearance of the electron tracks it is likely that they were emitted 
within 100 milliseconds. No recorded isotope of chlorine has a short enough 
half-life to allow the observation of the events. The shortest known half-life 
is that of “Cl, reported as 1.4 minutes, by MORINAGA [1] with a maximum 
energy of 7.5 MeV. However “Cl has a longer half-life than might be expected 
since its ground state has spin 2, while the ground state of argon is spin zero. 
The transition is therefore forbidden, and it is possible that an isomeric state 
of “Cl might exist with spin zero. Such a state has not been reported, but 
would probably not have been detected in any case. Such an isomeric state 
would be expected to have a much shorter half-life (between 0.15-and 15 s). 

The possible consistency of the electron energies with a line spectrum raises 
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the possibility of them being internal conversion electrons. They cannot be 
interpreted in this way since the process could not compete with direct pair 
production to the ground state of Cl. 


3. — Conclusion. 


It is concluded that the events arise from the formation of a previously 
unknown isomeric state of ‘01. 

It is to be noticed, that if the isomeric explanation is correct it may be 
necessary to take this phenomenon into account in the calculations of neutron 
multiplicity after capture of a negative u-meson by a nucleus. 

It is intended to verify the existence of the supposed isomeric state by 
bombarding our cloud chamber, filled with argon, with neutrons from a Cock- 
croft-Walton accelerator in this department. The n-p reaction will create *Cl, 
and if the isomeric state exists, electron tracks, associated with the ends of 
the recoil protons, should be visible. It is hoped in this way to find the correct 
interpretation of these three events. 
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Impiego della dipendenza della legge di decadimento 
di scintillatori organici dalla densità di eccitazione 
per discriminare neutroni veloci dal fondo 7 (*). 


M. FORTE (+) 
Laboratori CISE - Milano 


1. — Introduzione. 


È noto che le scintillazioni prodotte in materiali organici da particelle 
ionizzanti hanno caratteristiche che possono dipendere in larga misura dalla 
natura e dall’energia della particella. 

Di notevole interesse pratico sono gli effetti della densità di ionizzazione 
ed eccitazione sulla efficienza di scintillazione e sull’andamento dell’intensità 
di emissione in funzione del tempo. Tali effetti sono tra loro strettamente 
collegati. 

Di solito ad una maggiore ionizzazione specifica corrisponde una efficienza 
di scintillazione minore. Ad esempio, le particelle x hanno una efficienza del- 
Vordine di 1/10 relativamente a £ energici; nella maggior parte degli scintil- 
latori organici. 

In generale la scintillazione incrementale dN/dE (N numero di fotoni) e 
la scintillazione per unità di percorso dN/dx, che sono funzioni dell’energia Æ 
della particella considerata, si possono esprimere mediante funzioni uniche di 
dE/dr, come è stato mostrato da TAYLOR ed altri [1]. 

Per interpretare la dipendenza dell’efficienza di scintillazione dalla ioniz- 
zazione specifica, occorre considerare gli effetti di quenching della fluorescenza 
nella scia della particella dovuti a vari processi, che è opportuno distinguere 
in unimolecolari e bimolecolari. 


(*) Lavoro presentato al XLIII Congresso Nazionale della Società Italiana di 
Fisica, Padova-Venezia, 22-28 Settembre 1957. 
(+) Attualmente del C.N.R.N. 


IMPIEGO DELLA DIPENDENZA DELLA LEGGE DI DECADIMENTO ECC. 391 


Tra i processi unimolecolari, la conversione e degradazione interna di mole- 
cole eccitate è prevalente per basse densità di ionizzazione (8 energici), mentre 
i danneggiamenti permanenti del materiale prodotti inizialmente dalla parti- 
cella hanno importanza per alte densità di ionizzazione. 

I vari processi di quenching bimolecolari, dovuti ad interazioni mutue di 
molecole eccitate o ionizzate, sono favoriti dalla densità di queste nella scia 
della particella [2]. 

Attribuendo il quenching della fluorescenza a soli processi di tipo unimole- 
colare, si trova un buon accordo con la dipendenza sperimentale dell’efficienza 
di scintillazione da dÆ/dæ per elettroni, protoni, deutoni, «, di varie energie, 
in antracene [3]. 

In queste ipotesi, si prevede un decadimento dell’eccitazione e della inten- 
Sità di emissione di tipo esponenziale e con vita media proporzionale alla effi- 
cienza di scintillazione. 

Per x e per 6, ad esempio, si deduce dalle efficienze un rapporto tra le vite 
medie dell’ordine di 1/10, mentre sperimentalmente le durate delle scintilla- 
zioni sono dello stesso ordine anche per particelle con alta ionizzazione speci- 
fica. Non si può render conto di questo fatto semplicemente con processi di 
tipo unimolecolare. 

La dipendenza dal tempo della probabilità di diseccitazione delle molecole 
è stata da WRIGHT [4] attribuita a processi di quenching bimolecolari che, in 
aggiunta al quenching unimolecolare, competono, quanto più alta è la densità 
di eccitazione, con la emissione fluorescente. 

In questo caso, per le efficienze di scintillazione si hanno risultati analoghi 
a quelli precedentemente citati. 

L'intensità di emissione prevista ha un decadimento inizialmente rapido, 
che si converte, per tempi più lunghi, in un decadimento esponenziale avente 
la medesima costante di tempo per particelle con diversa ionizzazione speci- 
fica. Le vite medie complessive per particelle « e per elettroni risultano dello 
stesso ordine, in accordo qualitativo con l’esperienza. 

Per osservare la rapida emissione iniziale occorrerebbe un sistema con un 
tempo risolutivo dell’ordine di 1 ns. 

WRIGHT [5] ha studiato sperimentalmente le forme degli impulsi di scin- 
tillazione eccitati con « e con y in cristalli di antracene, stilbene, terfenile, 
con un metodo indiretto basato sul fatto che la potenza generata nella resi- 
stenza di carico dell’anodo di un fotomoltiplicatore, da una serie di impulsi 
di scintillazione dipende dalla forma di questi. 

Con una analisi dei risultati opportuna, WRIGHT ha dedotto forme di scin- 
tillazione composte di un breve picco iniziale (— 1 ns), e di una componente 
a decadimento lento. 

Nel caso dell’antracene questa contiene la massima parte dell'emissione, 
ed ha andamento esponenziale. 
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È assai significativo il fatto che la costante di decadimento è risultata 
apprezzabilmente maggiore per eccitazione con particelle « che per elettroni, 
rispettivamente 53 e 31 ns. 

Un effetto analogo è stato rilevato anche per lo stilbene [6]. 

Secondo l’interpretazione di WRIGHT, il picco di fluorescenza iniziale è 
emesso da una piccola frazione delle molecole inizialmente eccitate che deca- 
dono rapidamente per effetto dell’alta probabilità di quenching. 

La componente più lenta è emessa da una buona parte delle molecole ecci- 
tate che si formano per ricombinazione di molecole ionizzate con elettroni 
quando la densità di eccitazione è apprezzabilmente diminuita, e sono perciò 
meno soggette al quenching. 

Per alta densità di eccitazione iniziale, ad esempio con particelle x, le con- 
dizioni di quenching hanno maggiore durata; poichè l’emissione è soppressa 
maggiormente all’inizio, risulta, oltre ad una riduzione nella efficienza, anche 
un allungamento nella vita media apparente della componente lenta. 


2. — Parte sperimentale. 


Il metodo sperimentale che descriveremo è stato progettato allo scopo di 
discriminare particelle che, in relazione alla ionizzazione specifica prodotta nel 
materiale scintillatore, diano luogo ad impulsi di scintillazione di forma dif- 
ferente. 

Un requisito di notevole importanza pratica è che la discriminazione sia 
indipendente dalle ampiezze degli impulsi entro un largo campo. 

Abbiamo perciò adottato il seguente principio: considerato un impulso di 
scintillazione, la parte A, di esso raccolta dall'inizio sino ad un opportuno 
istante Ÿ è una frazione dell’ampiezza complessiva D, determinata unicamente 
dalla forma dell’impulso. Si potrà formare 
con A, e con D una combinazione lineare 
che sia di ampiezza nulla per tutti gli im- 
pulsi di un dato tipo e di ampiezza qual- 
siasi, che possono così essere selezionati. 

Nella realizzazione sperimentale (vedi 
Fig. 1), l'impulso di scintillazione è raccolto 
dal fotocatodo di un fotomoltiplicatore 
E.M.I. 6255 «a 13 stadi di moltiplica- 
zione. 


E.M.I. 6255 


-1500 +2000 V 


In seguito alla moltiplicazione si rac- 
colgono impulsi corrispondenti suli’anodo, 
Fig. 1. — Discriminatore di forma CON carica negativa, e sul penultimo dinodo, 

degli impulsi. con carica positiva. 
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Per evitare effetti di induzione tra questi, l’ultimo dinodo viene usato come 
schermo. 

L’impulso di tensione sull’anodo interferisce con la propria riflessione fuori 
fase generata da una linea cortocircuitata, dopo un ritardo 9 determinato dalla 
lunghezza della linea. 

La forma dell’impulso di tensione risultante viene rettificata per mezzo 
di un diodo a cristallo Philips OA5, che si chiude verso massa quando la ten- 
sione raggiunge valori positivi. 

Poichè la linea rimane allora chiusa su una resistenza di circa 1000 ohm, 
pari alla propria impedenza caratteristica, non si hanno riflessioni multiple. 

L’impulso di tensione nel ramo A ha forma corrispondente all’impulso ori- 
ginario sull’anodo, per ¢ da 0 a Ÿ, ed è circa nullo per tempi successivi. 

Dal penultimo dinodo l’impulso è prelevato mediante un condensatore di 
accoppiamento di 0.1 uk’. Attraverso la resistenza di carico variabile È, cia- 
scun impulso produce una tensione positiva, il cui valore integrato per tutta 
la durata dell’impulso è (1/R,):Q, essendo Q la carica totale raccolta dal 
dinodo. 

Gli impulsi di tensione in A e in D vengono integrati, attraverso le resi- 
stenze di 20 kQ e le capacità parassite, per un tempo variabile dell’ordine 
di 0.1 us, e vengono quindi mescolati. 

La resistenza variabile À, consente di regolare il guadagno nel ramo D 
rispetto al ramo A. 

L’impulso risultante viene formato attraverso una costante di tempo di 
integazione di alcuni ps. 

Gli impulsi in uscita dal discriminatore di forme sono inviati ad una comune 

‘atena di amplificazione lenta, costituita di un cathode follower e di un ampli- 
ficatore modello 100, e possono 
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un discriminatore a soglia ed 
una scala (vedi Fig. 2). 

L’apparecchiatura descritta 
è stata usata principalmente 
per distinguere impulsi di neu- Fig. 2. 
troni veloci di una sorgente di 
Po-Be, dal fondo y naturale o prodotto artificialmente con sorgenti di !*7Cs, Ra, Th. 

Da queste esperienze si sono ottenute informazioni sulle caratteristiche di 
emissione dei vari scintillatori adoperati. 

Lo scintillatore, collocato a contatto ottico con il fotocatodo, veniva irrag- 
giato in un primo tempo con la sola sorgente di 197Cs (~ 660 keV), gli impulsi 
risultanti venivano osservati all’oscilloscopio, e la resistenza variabile R, era 
regolata finchè non si osservavano più impulsi di segno positivo. 


Discriminatore 
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Amplificatore Discriminatore | 
di ampiezza peel 


Oscilloscopio 
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Nel circuito anodico del fotomoltiplicatore, gli impulsi che non sono almeno 
dell’ordine di un volt non vengono perfettamente rettificati dal diodo, tuttavia, 
data la particolare disposizione del circuito, se l’uscita corrispondente ad im- 
pulsi di una certa ampiezza è circa nulla, impulsi della stessa forma e di am- 
piezza maggiore danno uscite tutt'al più alquanto negative. Infatti anche ir- 
raggiando lo scintillatore con sorgenti di Ra (£,,,= 1.8 MeV) e di Th 
(E... 2.6 MeV) non si osservano, nelle stesse condizioni, impulsi positivi. 

Anche gli impulsi di fondo del fotomoltiplicatore davano uscita non posi- 
tiva in quanto più brevi degli impulsi di scintillazione. 

Con una piccola polarizzazione positiva del discriminatore di ampiezze si 
aveva, con scintillatori di piccole dimensioni (— $in.), un conteggio di 
(2--3) imp/min, praticamente indipendente dall’irraggiamento y, sino ad inten- 
sità corrispondenti a (10-105) imp/min. 

In queste condizioni, irraggiando lo scintillatore con i neutroni di una sor- 
gente di Po-Be, si osservavano impulsi positivi con ampiezza massima parec- 


TABELLA I. — Scintillatori per la discriminazione di neutroni veloci. 

I Salza PMI de ae ROUTE r'euRel REP 8 
| Efficien- | | 
| | | Efficienza 
| | Vita Menta PE 
i Numero | media Forma e | Plessiva | desi | 

À NOT Si 
| Tipo | di | discin- | volume DEE BI impulsi 
| | atomi H | tillazione | del |- troni | 197 Li | 
| Poe > | : della sor-| (Cs, | 
| per em (raggi y) campione | : Ra. TI 
| = | gente di | a, Th) 
| 
| | | Po Be | 
| | | | % % 
- a i | pda ul — — 
| | | | 
Antracene | 5.15-1022 30-35 Cubo ‘$i I BST 
| | | 2.05 em’ 

; eee (= | LIA] | 
Trans-stilbene jp SAT AO ~6 | Cubo 1 em?’ 6.5 » » 
Liquido «NE 202» | | | {Cilindro 13 » » 

(Nuclear 2in. X 2in. 
| 
n A LI N | | 
Enterprises | 51.5 em$ | 
Ltd.) i Dee S102) sel | 
| | ER 
| | Cilindro 4 Pes » 
| . . 
| | +in.X+in. 
| | | | 1.61 em? 
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Plastico «Sintilon Brand » (National Radiac Inc.). 
| Polistirene + 2.5% Terfenile + 0.03 % TPB (I.N.F.N. - Bologna). 
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chie volte maggiore rispetto alla polarizzazione del discriminatore, che veni- 
vano contati. 

I rendimenti ottenuti nelle condizioni sperimentali pit opportune, per neu- 
troni veloci della sorgente di Po-Be con scintillatori di vario tipo e volume sono 
riportati nella Tab. I (*), insieme con l’elenco di altri materiali scintillatori 
che hanno dato risultati qualitativamente comparabili ai precedenti. 

In ciascun caso le misure venivano ripetute impiegando linee con tempi 
di riflessione diversi, da 12.5 a 50 ns, e si ottenevano risultati dello stesso tipo. 


3. — Discussione dei risultati. 


Questi risultati indicano che nelle scintillazioni prodotte da protoni di rin- 
culo la frazione di luce emessa oltre un tempo Ÿ dell’ordine di qualche decina 
di mus è apprezzabilmente maggiore che nel caso di eccitazione con elettroni, 
per tutti i materiali elencati. 

Nel caso dell’antracene tale fenomeno concorda in sostanza con le esperienze 
di Wright, precedentemente discusse. 

Nel caso dello stilbene e dei vari sistemi liquidi e plastici, ai quali vengono 
attribuite vite medie di scintillazione più brevi di — 10-# s, gli effetti di discri- 
minazione tra protoni ed elettroni da noi osservati implicano l’emissione di 
componenti lente (che durano oltre — 50 mus), con intensità apprezzabile 
almeno nelle scintillazioni eccitate da particelle densamente ionizzanti, ma non 
necessariamente nel caso di elettroni veloci. 

L’esistenza di tali componenti lente trova riscontro nelle osservazioni di 
JACKSON e HARRISON [7,8] dei decadimenti di scintillatori organici eccitati con 
raggi X pulsati. Date le limitazioni sperimentali, essi potevano rilevare com- 
ponenti con costanti di tempo non inferiori a 0.1 us. 

Per gli scintillatori cristallini (antracene, stilbene, ...) e plastici sono ripor- 
tate componenti lente con vite medie poco più lunghe di 0.1 us insieme con 
altre di maggiore durata (alcuni us o più). Queste ultime sono attribuite da 
JACKSON e HARRISON a stati metastabili nelle molecole dello scintillatore. 

Con scintillatori liquidi non è stata rilevata alcuna componente lenta (con 
vita media < 0.1 us) di intensità valutabile, in quelle esperienze. 

In un confronto con questo risultato negativo, osserviamo che nel nostro 
caso l’emissione lenta qualitativamente rilevata per sistemi liquidi potrebbe 
essere riferita, oltre che al diverso modo di eccitazione, a componenti lente con 
vite medie anche inferiori a 0.1 us. 


(*) Questi valori sono riferiti all'intensità nominale della sorgente, che era data 
con + 15% di deviazione standard. 
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È verosimile che anche in scintillatori con vita media complessiva molto 
breve, le forme di scintillazione siano determinate in funzione della densità 
di ionizzazione iniziale per effetto di processi analoghi a quelli discussi nel caso 
dell’antracene. 

Nello stilbene, ad esempio, la ditl'usione degli ioni, che porta alla ricom- 
binazione di molecole in stati eccitati, non dovrebbe avere una durata sostan- 
zialmente diversa che nell’antracene, ed è plausibile che le condizioni iniziali, 
create dalla particella in relazione alla sua perdita di energia specifica, inter- 
vengano con peso anche maggiore nell’equilibrio fra le quantità di luce emesse 
nelle successive fasi della scintillazione. Infatti una parte importante della emis- 
sione dura solamente una prima decina di ns (essendo —6 ns la vita 
media misurata per elettroni), cosicchè quelle condizioni di quenching che 
agiscono prevalentemente in questa fase di alta densità di eccitazione si esau- 
riscono in un tempo breve rispetto alle componenti lente dell’emissione, che 
durano oltre — 50 ns per protoni. 

Concludiamo osservando che questi fenomeni meritano di essere studiati 
in modo quantitativo specialmente nei sistemi liquidi, in vista anche di nuove 
possibili applicazioni di questi scintillatori. 

Sarebbe utile tentare di estendere il metodo da noi seguito per discriminare 
il fondo y nei rivelatori a liquido caricato con boro, che sono tuttora gli unici 
con buona efficienza per neutroni di energia intermedia, e, in generale, per 
discriminare il fondo y in misure di conteggio interno con emettitori «. 

In questi casi è da prevedere una limitazione imposta dalla bassa sta ti- 
stica che determina le forme effettive di impulsi x di bassa energia. 

Alle alte energie, la buona determinazione statistica delle forme di impulso 
potrebbe consentire la selezione di particelle che diano luogo a forme di emis- 
sione alquanto differenti, in relazione a diversità di ionizzazione specifica o 
di distribuzione della ionizzazione secondaria lungo la traccia che influiscono 
sui processi di quenching e di ricombinazione degli ioni. 

A titolo indicativo, riportiamo che con uno scintillatore liquido cilindrico 
di 2in.x2in. il conteggio di fondo, in assenza di neutroni, aveva una fre- 
quenza costante superiore rispetto ad un rivelatore della stessa natura e di 
picecle dimensioni, e tuttavia indipendente dalle condizioni di irraggiamento 
anche intenso con y di energia fino a ~ 2.6 MeV. 

Tali impulsi, positivi all’uscita del discriminatore di forme e con ampiezze 
considerevoli, si devono presumere dovuti a particelle della radiazione cosmica, 
benchè non si abbiano sufficienti informazioni sperimentali sulle caratteristiche 
che ne determinavano la discriminazione dagli impulsi y. 


— 
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Si riferisce su risultati di misure del tempo di rilassamento termico 7, dei 
protoni in alcuni liquidi a varie temperature. 

Il metodo di misura si basa sull’osservazione dei segnali dovuti a passaggi 
rapidi adiabatici dei nuclei attraverso la risonanza. Il dispositivo sperimentale, 
già descritto altrove [1], è stato da noi perfezionato allo scopo di poter variare 
e misurare con sufficiente precisione la temperatura del campione in esame. 

Per la variazione della temperatura noi usiamo una corrente di azoto, la 
quale viene fatta passare attraverso un riscaldatore o un refrigeratore e viene 
quindi inviata nella stretta intercapedine compresa tra la parete cilindrica 
della fiala contenente il liquido e il supporto della bobina ricevente. La misura 
della temperatura viene effettuata per mezzo di una termocoppia rame-costan- 
tana, una saldatura della quale, opportunamente protetta, si trova nell’in- 
terno della fiala. La precisione nella lettura della temperatura è dell’ordine 
di 1/10 di grado; le differenze di temperatura fra i diversi punti del campione 
risultano dello stesso ordine di grandezza, se si fa in modo che le variazioni 
della temperatura siano abbastanza lente. 

Col metodo ora descritto noi abbiamo fino a questo momento misurato il 
tempo di rilassamento dell’acqua per un intervallo di temperatura tra — 10 °C 
e +90 °C circa e quello dell’alcool etilico per un intervallo di temperatura tra 
+25 °C e +70 °C circa. I risultati sono esposti nelle Fig. 1 e 2. 

Dall'esame della fig. 1 si può osservare che il tempo di rilassamento del- 
l’acqua da noi misurato per una temperatura di 27 °C (300 °K) corrisponde 
al valore di 3.48 calcolato per la stessa temperatura dalla teoria di Bloem- 
bergen, Purcell e Pound [2, 3] con una precisione forse superiore a quella che 
ci si potrebbe attendere. 


In generale, noi possiamo aspettarci che la teoria dia risultati in accordo 


(*) Lavoro presentato al XLIII Congresso Nazionale della Società Italiana di 
Fisica, Padova-Venezia, 22-28 Settembre 1957. 
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bj AG) . . . pile . FE 2 
con l’esperienza solo nei casi in cui il modello di Debye, adottato per la descri- 
zione dei movimenti delle molecole, rappresenti una sufficientemente buona 
approssimazione. 
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Hie els — Tempo di rilassamento 7, dell’acqua a varie temperature. 


In particolare, in base alla teoria ci si può attendere che la quantità T,7/T 
(n= viscosità, T— temperatura assoluta) sia indipendente dalla temperatura. 
Le Fig. 3 e 4 mostrano che, per l’acqua e per l’alcool etilico, si hanno sensibili 
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Fig. 2. - Tempo di rilassamento 7, dell’alcool etilico a varie temperature. 
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deviazioni da questa legge. Il fatto che 77,7/7 diminuisce al crescere di 7 non 
sembra in accordo con l’ipotesi che in questi liquidi esistano ben definite asso- 
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Fig. 3. — T,n/T per l’acqua a varie temperature. 


ciazioni molecolari, ma piuttosto con l'ipotesi dell’esistenza di una struttura 
quasi-cristallina in progressiva distruzione al crescere della temperatura. 
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Fig. 4. — Tin/T per l’alcool etilico a varie temperature. 


Nel caso dell’acqua si può notare che T,7/T si mantiene circa costante nella 
regione tra — 10 °C e +10 °C. Il confronto con altri liquidi in prossimità del 
punto di fusione potrà permettere di stabilire se questo andamento è da attri- 
buirsi al particolare comportamento dell’acqua in prossimità di +4 °C. 
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1. - Stato attuale dei lavori. 


Già in occasione dell’ultimo congresso della Società Italiana di Fisica a 
Torino abbiamo dato particolari sul nostro progetto [1], ed in questa relazione 
passiamo quindi in rassegna l’attuale situazione dei lavori, sottolineando alcuni 
contributi particolari su determinati problemi. Riferiamo poi brevemente sulla 


attuale situazione per la preparazione delle esperienze. 


Nella tabella I è riportata la attuale distribuzione dei compiti tra i fisici e 


TABELLA I. — Attuale distribuzione dei compiti tra i fis 
a ee È ; I 7 i = 
Magnete Radiofrequenza Vuot | 
. 110 . 
| e misure ed k bt : + Iniettore | Gruppo 
i magnetiche elettronica circ. | ten 
AMMAN ALBERIGI CORAZZA AGENO | BERNARDIN 
| | 
BOLOGNA MASSAROTTI SIRCANA BIZZARRI | SONA i 
DIAMBRINI MONTELATICI QUERZOLI | TURRIN 
GHIGO PUGLISI REALE | 
SACERDOTI QUERCIA 
SALVINI 
SANNA | 
(L) 7 5 2 4 3 
(a) y 11 2 = 


(*) Lavoro presentato al XLIII Congresso Nazionale della Società Italiana di 
Fisica, Padova-Venezia, 
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gli ingegneri dei Laboratori di Frascati. Il personale della Sezione Acceleratore 
al completo si è trasferito dall'Istituto di Fisica di Roma ai Laboratori di 
Frascati nel mese di Giugno di quest'anno. Attualmente si può considerare 
superata la fase di trasloco e di primo ambientamento, anche se buona parte 
dei laboratori e dei servizi sono alloggiati in sedi non definitive, in attesa 
dell’ultimazione del la- 


boratorio a tre piani 
attualmente in costru- | 
2 Condensatori 
zione. J 
: SM DT N 
RE RE 
Sala < > Gala 


Esperienze 
Magnete. Macchine je] 


Per i dati caratte- 
ristici sul magnete ri- 
mandiamo alle nostre 
pubblicazioni [1]. 

In Fig. 1 si vede 
la sala dell’edificio per 
il Sincrotrone (Scac- 
CIA), nella quale è posto il magnete e dove si svolgeranno le esperienze. 

In Fig. 2 è riportata una sezione del nostro magnete (SACERDOTI) limitata- 
mente ai particolari che questa relazione permette. 


i 
gli ingegneri dei Laboratori di Frascati. 
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[zo + 0 | È stata ultimata la costruzione 
ee ee gee dei 64 blocchi di lamierini (ciascu- 
no di 30 cm di dimensione azimu- 
tale) che costituiscono il nucleo del 
magnete. I blocchi sono ‘stati col- 
laudati geometricamente e misurati 
magneticamente presso la ditta co- 
struttrice ed è iniziato il montaggio 
del magnete (Guico). È ultimata 
anche la costruzione delle espansio- 


960 


ni polari. 

È arrivata ai Laboratori la pri- 
ma delle quattro bobine necessarie 
per i quattro quadranti. Essa è sta- 
ta preparata con la normale tecnica 
di compoundatura, oltre che ulte- 
riormente nastrata a freddo nell'insieme dei conduttori interni. È raffreddata 
ad acqua ed ha una struttura per necessità non usuale (GHIGO). 

Come è noto il magnete sarà eccitato in corrente alternata (20 p/s) più 
corrente continua. È pertanto necessario un induttore di protezione (ToscHI). 
Questo induttore è senza nucleo di ferro, in una struttura suggerita dal prof. So- 
MEDA dell’Università di 
Padova. L’induttore è 
attualmente in fase di 


R = 3600 


Bigs. 


| 


impregnazione e la sua ===} à 


| 
| 


È 


consegna è prevista en- 
tro un mese da oggi 
(in Fig. 3 è indicato 
lo schema del nostro 
induttore di protezio- 


/nauttore di protezione 


ULI 


ne). 

I banchi di conden- 
satori per il magnete 
sono completamente 
montati. 

Nei prossimi giorni 
si installerà il quadro 
di controllo. Il gruppo convertitore è quasi completo poichè sono montati il 
trasformatore, i due gruppi rotanti, gli interruttori, le sbarre di collegamento 
ed il quadro (AMMAN). Dovranno ancora iniziare la messa a punto dei re- 
golatori e le prove delle macchine su un carico resistivo, che dureranno pre- 
sumibilmente circa tre settimane (AMMAN). 
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A magnete completamente montato vi sarà una ulteriore messa a punto 
dei regolatori sul carico reale. 


Bobine di correzione. 


Si è iniziata la costruzione definitiva delle varie bobine di correzione che 
serviranno per la definizione fina del campo magnetico (AMMAN, .CORAZZA, 
SANNA). Queste bobine hanno chiesto uno studio piuttosto elaborato, e per 
questo è stato montato a Frascati il pezzo di magnete della lunghezza di un 
metro costruito a suo tempo. Su questo magnete sono stati sperimentati in 
eccitazione continua gli effetti dei 


circuiti di alimentazione della re- n 
gione focalizzante del campo e si La 
è giunti, dopo numerose prove, 14 
alla selezione di una serie con- / 
veniente di fili ed a valori di ’° 
correzione ormai quasi definiti- 10 
vi [2]. 

Sulla base dei risultati speri- # 
mentali ottenuti in corrente con- x 
tinua è stata precisata l’alimen- i 


tazione di questi circuiti ed è 
stata quindi avviata la costruzio- 02 
ne dell’alimentazione occorrente 

per tutto il magnete. 

Attraverso numerose serie di -02 
misure sono state poi effettuate 
opportune scelte di insieme dei 
fili che realizzano le correzioni 
di n di tipo prefissato anche in 
corrente alternata. È stata col- 
laudata con successo l'alimentazione a questo scopo progettata e costruita. 

In Fig. 4 diamo i risultati delle misure in c.a. del gradiente n del campo 
su tutta la profondità della gap quando non vi sono bobine di correzione 
(curva 1), e per diverse correnti (con un sistema di 8 fili appoggiati alle espan- 
sioni polari), che hanno lo scopo di ruotare, nel modo voluto e indicato nel 


diagramma, il valore del gradiente. 


Misure magnetiche. 


È stato ultimato, come si è detto, il nostro programma di misure magne- 
tiche su ciascuno dei 64 blocchi che costituiscono il magnete (BOLOGNA, DIAM- 
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BRINI, GHIGO). Queste misure sono state eseguite al campo di iniezione, a campo 

residuo, a campo medio ed a campo massimo. 

Nella Fig. 5 è riportata la distribuzione dei valori del campo massimo 
tra i vari blocchi, per una deter- 
minata eccitazione. 

Questi risultati, considerate le 

nostre condizioni di eccitazione, si 

(pacchr sinistri) possono considerare soddisfacenti. 
I risultati così ottenuti sono sta- 

ti elaborati dal nostro gruppo 

teorico in collaborazione col grup- 


À po di misure magnetiche a fine 


[ di stabilire l’ordine migliore con 
(pacchi destri) È 


cui i blocchi dovranno essere al- 
lineati sul Sincrotrone (BERNAR- 
DINI, SONA, TURRIN). 

Si è quindi calcolata la forma 
dell’orbita chiusa relativa alla di- 
stribuzione dei blocchi decisa, e 

Fig. 5. nell’ipotesi che le fluttuazioni azi- 

mutali di campo magnetico siano 

essenzialmente dovute ai blocchi più che alle espansioni polari. La forma del 

l’orbita chiusa è indicata nella Fig. 6. Come si vede l’orbita chiusa si scosta 
dall’orbita ideale di equilibrio per non più di 2 mm a campo massimo. 

Dai primi di settembre è iniziata la programmazione delle misure magne- 


—Iniettore 


Fig. 6. 


\ 


tiche sul primo quadrante del sincrotrone e si sta montando e mettendo a 
punto tutta l’apparecchiatura necessaria. 

La nostra strumentazione per le misure magnetiche si è dimostrata sod- 
disfacente, ed anzi può dirsi che essa è probabiimente la più avanzata oggi 
esistente per misure su macchine nucleari. Tra gli strumenti originali o le in- 
novazioni apportate a queste misure, vogliamo ricordarne alcuni. 
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Lo strumento approntato da DIAMBRINI per le misure del piano magnetico 
mediano in corrente continua è ormai in uso presso di noi da più di un anno 
e soddisfa egregiamente alle sue funzioni. Tale strumento è già stato descritto [3]. 

Si è studiato un nuovo ti- 


po di bobinetta rotante (GHI- —30 —4 
GO) per la misura del gradiente 
n di campo da 500 a 10000 IL \ 4 


gauss. L'apparecchio consiste 
di una bobina con l’asse ma- | 
gnetico ortogonale al piano ira 
magnetico mediano, e che ruo- 


ta intorno ad un asse anch’esso 

ortogonale al piano magnetico dI 6) 
mediano, come descritto nella Bigs 75 

Fig. 7. Il moto di rotazione 

della bobina è ottenuto mediante una turbina a palette ad aria compressa. 


Facendo ruotare la bobina nelle condizioni di cui sopra, in un campo ma- 
gnetico con gradiente b si ha una forza elettrica alternata il cui valore di 


picco è: 


e = 2(B, — B,)2xv, 


dove » è l’area spire della bobina e B, — B, sono i valori del campo ad una 
distanza 2r, essendo r la distanza tra l’asse magnetico e l’asse di rotazione. 
La misura di B, — B,, dalla quale si risale al valore di n, può realizzarsi indi- 
pendentemente dalla frequenza », integrando il segnale di uscita. Lo schema 
a blocchi dell'apparecchio è indicato in Fig. 8. 

Le misure di » all’inieizione, in eccitazione alternata più continua, sono 
state fatte impiegando il 
metodo delle peaking strips 
[3]. 

Si sta studiando anche 
RES de ets Vautomazione di queste mi- 

di picco sure per guadagnare tempo 
| intorno al magnete. Si è 
concluso per una ragionevo- 

le automazione che verrà à 

suo tempo descritta. Rite- 

niamo infatti che sia conveniente pubblicare in una nota estesa la descri- 
zione degli strumenti di misura da noi costruiti e le misure magnetiche effet- 


Fig. 8. 


tuate. 
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Iniettore. 


L’iniettore, come gia noto dalle nostre relazioni [1], (pag. 1), è del tipo di 
Cockroft e Walton. Esso è curato dai fisici dell’Istituto Superiore di Sanità 
di Roma. 

Il moltiplicatore di tensione è completo in tutte le sue parti. Sono in questo 
periodo in corso le prove di collaudo ad alta tensione del dispositivo montato 
nella sua tank in azoto compresso a 12 atm. Si prevede che tali prove dure- 
ranno ancora una decina di giorni dopo di che tutto il complesso sarà traspor- 
tato a Frascati per la messa in opera. L’alimentatore a 1000 periodi stabi- 
lizzato è già stato collaudato e trasportato a Frascati. 

I catodi e la sorgente ed il tubo acceleratore sono pronti; la sorgente ed i 
singoli elementi del tubo acceleratore sono stati collaudati alla loro tensione 
di lavoro su di un banco da vuoto provvisorio. 

Il banco da vuoto è stato automatizzato ed il suo collaudo è ora in corso. 
Si sta provvedendo al montaggio su di un carrello indipendente dalla pompa 
a diffusione. 

I circuiti per il comando da terra della sorgente di elettroni sono pronti e 
collaudati. È in corso il montaggio definitivo dei circuiti di comando dei fila- 
menti del catodo di comando della sorgente destinati a stare nella parte alta 
tensione del moltiplicatore; anche questi circuiti sono stati collaudati in mon- 
taggio di prova. 

Sono state già progettate ed eseguite le lenti di focalizzazione del fascio 
di elettroni ed i relativi alimentatori stabilizzati in corrente. Sono progettate 
o in corso di esecuzione le bobine deflettrici per la centratura del fascio. 

Il deflettore, insieme a tutti i meccanismi che ne permettono i movimenti 
è già costruito. È quasi ultimato il pannello che contiene tutti i relativi comandi. 
È iniziata la costruzione del giunto mobile di collegamento fra il deflettore 
e l’iniettore. È stato costruito in montaggio di prova un alimentatore altamente 
stabilizzato a 50 kV per il deflettore, che è attualmente in collaudo; contem- 
poraneamente è stato cominciato il suo montaggio definitivo. 


Vuoto e ciambella. 


L'impianto di vuoto dell’elettrosinerotrone sarà pronto per il montaggio 
entro questo mese. 

Presso i nostri laboratori è ora in via di realizzazione il pannello dei tele- 
comandi dell’intero impianto di vuoto da montarsi sulla consolle di comando 
generale del Sincrotrone. 

Dal 1° Settembre 1957 si è cominciata la costruzione della ciambella nelle 
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Sue dimensioni definitive apportando al vecchio modello aleune modifiche che 
la rendono meccanicamente più resistente e di maggiore semplicità di mon- 
taggio (CORAZZA, SIRCANA). 
Da prove fatte in un compressore idraulico la ciambella resiste senza rom- 
persi ad una pressione di 2.5 atm. La rottura avviene a circa 3 atm od oltre. 
Ogni quadrante di ciambella sara composto di tre settori uniti fra loro 


con una guarnizione a tenuta di vuoto tipo « O » ring alloggiata sulla testata 
di ogni settore in una gola ricavata per mezzo di un pantografo. 

In ogni quadrante la ciambella ha cinque finestre trasparenti che permet- 
tono di esplorare l’interno. In prossimità di tali finestre sono sistemate bandie- 
rine fluorescenti normalmente tenute in posizione tale da non disturbare il 
percorso degli elettroni. Le bandierine per mezzo di un sistema meccanico à 
tenuta di vuoto possono essere poste in direzione normale al fascio in modo 
da individuarne la posizione. Tali bandierine serviranno principalmente alla 
ricerca del fascio nel periodo di messa a punto dellelettrosincrotrone. 

In Fig. 9 è illustrata la sezione della ciambella. La ciambella è coperta 
all’interno con uno strato di acciaio inossidabile dello spessore di 5/100 di mm 
in striscie di 12 mm distanti tra loro 3/10 di mm. 


Radiofrequenza. 


È stata ultimata la costruzione, messa a punto e collaudo della catena di 
amplificazione per la radio frequenza (RF,) (ALBERIGI, LEPRI, PUGLISI, QUER- 
CIA). Si sta ora precisando l’inserimento della cavità risonante nella sezione 
diritta della macchina ad essa riservata, ed il passaggio della ciambella attra- 
verso di essa. Sono in costruzione i dispositivi di sicurezza e gli apparati per 
la messa in funzione automatica di ogni parte di questo impianto. È già stata 
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pubblicata in una delle nostre relazioni la descrizione particolareggiata della 
parte elettronica della RF, [4]. 

È stato risolto (QUERCIA [5]), con un dispositivo elettronico del quale diamo 
qui il principio di funzionamento, il problema di controllare la frequenza della 
cavità RF,, mentre essa varia rapidamente nel tempo: come è noto questa 
variazione è necessaria perchè gli elettroni all’iniezione (2.5 MeV totali) hanno 
una velocità iniziale inferiore del 2.1% alla velocità della luce. 

La misura della frequenza deve essere fatta in un tempo sufficientemente 
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Fig. 10. 


breve affinchè il valore di frequenza sia sufficientemente definito (almeno entro 
Puno per mille). 

Questo è stato realizzato confrontando ad ogni tempo voluto, con un si- 
stema di coincidenze, l’istante 7, in cui la frequenza », della cavità raggiunge 
un certo valore e l’istante 7, in cui il campo magnetieo ha il valore che a quella 
frequenza deve corrispondere. I due istanti devono essere in coincidenza. La 
frequenza v, viene direttamente prelevata mediante un segnale captato con 
una piccola antenna ed iniettato entro un circuito risonante avente l’opportuno 
fattore di merito ed accordato sulla frequenza »,. 

Diamo in Fig. 10 lo schema a blocchi di questo strumento. 

RF, (cavità risonante ad alta tensione). 

li elettroni vanno portati da circa 10 MeV a 1000 MeV per mezzo di 
una cavità risonante ad alta tensione (circa 60000 V,,., alla gap), della quale 
è terminata la progettazione nelle sue linee generali ed è stata iniziata la 
costruzione e messa a punto delle varie parti della catena (PUGLISI, MASSA- 
ROTTI). 

Lo stadio pilota (uscita 300 W alla frequenza finale) è stato costruito e 


collaudato; così pure il 1° stadio (uscita 1.3 kW) che però non è ancora in 
veste definitiva. 
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Lo stadio prefinale (10 kW) è parzialmente costruito e sarà provato in 
questi giorni; lo stadio finale (40 kW) è ancora in fase di modello in scala 1:1. 

Si è costruita una cavità risonante con 
la regione elettrica sotto vuoto e su di essa 
sono stati sperimentati i metodi per evitare 
il multipacting elettronico. Con queste cavità 
preliminari si sono ottenuti circa 25000 V,,,.. 

È attualmente in costruzione la cavità di 
tipo definitivo. Diamo in Fig. 11 lo schema 
in sezione di questa cavità. 

In un primo tempo si era conside- 
rato sufficiente una tensione massima di 
(30000 —40000) V. Questa cifra è stata oggi 
portata a 60000 V sulla base delle esperienze 
fatte recentemente presso altri sincrotroni e Con 
dei risultati teorici di vari autori [6]. 

Il nostro gruppo teorico (C. BERNARDINI) 
ha calcolato la tensione di R.F. massima ne- 
cessaria per la nostra macchina. La situa- 
zione è illustrata nel diagramma di Fig. 12. 
In questo -diagramma si dà ampiezza massi- 
ma di oscillazione di Sincrotrone, per varie 
tensioni massime della cavità, in funizone 
dell’energia degli elettroni. 

Affinchè non si perda una elevata percentuale degli elettroni è necessario 
che l’ampiezza massima delle oscillazioni permesse sia abbastanza più grande 
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delle dimensioni quadratiche medie che il fascio raggiunge per effetto delle 
fluttuazioni nella perdita di irraggiamento degli elettroni. 

Come si vede, questa condizione è rispettata sino a 1000 MeV solo se la 
cavità ha una tensione massima di almeno 50000 V. Si stima che per il 
nostro elettrosincrotrone sia conveniente una cavità di almeno 60000 V. 

In questi mesi si è calcolato [7] (PUGLISI, Quercia) il calo di tensione 
massimo della cavità RF, per diverse correnti di elettroni circolanti entro la 
ciambella. Le cavità da noi costruite e la potenza dei nostri impianti permet- 
teranno certamente di accelerare un fascio di 10! elettroni senza apprezza- 
bili perdite di tensione della cavità. 


Estrazione del fascio di elettroni. 


Si è fatto uno studio preliminare della possibilità di estrarre il fascio di 
elettroni dalla macchina allo scopo di fare esperienze con esso, oltre che con 
il normale fascio di fotoni. Queste ricerche hanno già indicato che questa ope- 
razione presenta varie difficoltà e non si è ancora trovata una via di sicuro 
SUCCESSO. 

In particolare (DIAMBRINI) si è esaminata l'eventualità di migliorare il 
rendimento di estrazione del fascio di elettroni disponendo due zone di campo 
magnetico alternato, una interna e l’altra esterna all’orbita stabile. In tal 
modo gli elettroni potranno entrare in queste regioni già con fase di risonanza 
e ad ogni giro. Questo meccanismo dovrebbe diminuire lo sparpagliamento 
degli elettroni stessi, che potranno così raggiungere il campo magnetico in 
maggior numero che non nel caso del metodo tradizionale di una sola regione 
alterata. 


2. — Preparazione delle esperienze con l’elettrosinerotrone. 


Lo sviluppo della preparazione delle esperienze ha proceduto secondo le 
indicazioni già contenute nel verbale della riunione svoltasi a Roma nel Gen- 
naio 1957 [8]. 

Non riteniamo pertanto opportuno riportare in questa sede i particolari 
progetti in corso e ci limitiamo ad alcune osservazioni generali. 


Strumenti per la misura del fascio. 


Si tratta di fare una misura assoluta dell’intensità del fascio dei y e di 
misurare sperimentalmente la distribuzione in energia dei fotoni che lo costi- 


LE >, De. 
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tuiscono. Per la misura dell’intensità si stanno costruendo (CIALDEA) gli oppor- 
tuni calorimetri, che misurano l’energia totale del fascio che in essi si estingue 
completamente. 

È inoltre in corso di progetto una camera a ionizzazione del tipo già costruito 
presso l’Università di Cornell [9]. 

Per lo spettro del fascio y è in progetto uno spettrometro a coppia (BENE- 
VENTANO, RISPOLI, SANNA). Il progetto del magnete per esso è particolar- 
mente oneroso e si sta tentando una soluzione che permetta di impiegare questo 
magnete nelle esperienze, una volta che il suo impiego con lo spettrometro 
a coppie sia esaurito. 


Magneti per le esperienze. 


Sono attualmente in costruzione due magneti deflettori, illustrati in Fig. 13. 
Come si vede si è scelto un magnete con giogo a C per lasciare liberi tre lati 
del traferro, poichè si ritiene che questo permetta una maggior latitudine di 
impiego. Per questi magneti è previsto un campo massimo di 16000 gauss, e 
l’area del traferro di 80x40 cm? permetterà deflessioni apprezzabili sino a 
1000 MeV. 


Si sono studiate le più convenienti forme polari per il focheggiamento delle 
particelle cariche. Sono previste espansioni polari a campo medio ed espan- 
sioni polari a focheggiamento forte. 

È sempre conveniente appoggiare le attrezzature generali a determinati 
progetti di esperienze, anche se questi progetti, per il tempo che intercorre 
tra il progetto e la possibile esecuzione delle misure, dovranno essere note- 
volmente mutati. Pertanto, i magneti ora in costruzione sono quelli adatti 
ad una esperienza di fotoproduzione di mesoni carichi con un fascio y di energia 
massima di 1000 MeV [10]. 
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Aleune osservazioni sulla distribuzione dei lavori. 


È importante tener presente che il Sincrotrone produrrà scientificamente 
non quando sarà finito ma solo quando sarà pronta insieme ad esso l’attrez- 
zatura generale per le esperienze, oltre all’elettronica, ed ai rivelatori di varia 
natura relativi a determinate esperienze. 

Come si è visto (cfr. tabella I) è ormai sufficiente il numero di fisici e di inge- 
gneri che lavorano alla macchina propriamente detta, ma è certamente troppo 
ristretto il numero di fisici che preparano le attrezzature generali o determi- 
nate esperienze. 

Dobbiamo tener presente che il nostro elettrosincrotrone è stato costruito 
per permettere di svolgere ricerche nucleari nel campo delle alte energie ai fisici 
‘interessati di tutte le Università italiane, e che a questa macchina, perchè la 
costruzione sia giustificata, dovranno avvicendarsi con continuità diversi gruppi 
di ricercatori per diverse esperienze. 

È difficile dire quando la macchina sarà funzionante poichè inizierà tra 
poco quella fase di messa a punto, sulla cui durata non ci si può impegnare. 
Debbo dire però che è opinione anche degli esperti con i quali abbiamo scam- 
biato che, al punto in cui oggi siamo, noi dovremmo avere un numero di fisici 
che si dedichino alla preparazione delle attrezzature generali e delle esperienze 
non molto inferiore al numero di fisici che si dedica direttamente alla mac- 
china. Attualmente invece i fisici che dedicano la loro maggiore attività alla 
preparazione delle attrezzature e delle esperienze non supera il numero di sei. 

È a mio parere possibile, ma non nel breve tempo ancora rimasto per questa 
relazione, dimostrare che i limiti di precisione della sperimentazione nucleare 
almeno con un elettrosicrotrone, non sono oggi tanto costituiti dalla precisione 
e ripetibilità della macchina quanto dalla precisione degli strumenti calibra- 
tori del fascio e dei rivelatori delle interazioni nucleari. 

Sotto questo punto di vista si può dire che un elettrosinerotrone merita 
uno sforzo sulle attrezzature generali e sui rivelatori di ogni esperienza, mag- 
giore di quanto non si faccia anche oggi in ogni laboratorio ove sono funzio- 
nanti elettrosincrotroni. La nostra situazione in questo senso rischia di essere 
anche peggior di quella di altre macchine già funzionanti. 


Elettronica per le esperienze. 


Da circa sei mesi è in funzione (RIspoLI) un Centro di elettronica che si 
occuperà particolarmente di realizzare sia gli strumenti elettronici per le espe- 


rienze, come i rivelatori propriamente detti (contatori di Cerenkov, scintil- 
latori, ecc.). 
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Si è considerata questa iniziativa necessaria, dato l’alto livello di specia- 
lizzazione raggiunto dall’elettronica. D’altra parte questo Centro dovrà realiz- 
zare alcuni prototipi che possano essere dati all’industria italiana per la loro 
moltiplicazione. 

Considerato l’attuale stato dei lavori, è da augurarsi che ben presto inizi 
a Frascati la preparazione delle esperienze con la nostra macchina, da parte 
dei fisici delle nostre Università. 

Uno stretto contatto tra chi fa la macchina ed i fisici che preparano espe- 
rienze è ormai, a mio giudizio, necessario. 
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Introduction. 


Recent experiments (‘) on strange particles have given some support for 
the possibility that elementary particles of spin higher than 1 may exist. 
In particular the observed angular correlation between the planes of production 
and decay of the hyperon A and Y, and the anisotropy of the angular distri- 
bution of the & decay products, seem to indicate that the spins of these part- 
icles are 3 or higher (?). 


(*) Part of dissertation submitted in partial satisfaction of the requirements for 
the degree of Doctor of Philosophy from the University of California at Los Angeles. 

(*) Now a CERN Fellow, at Geneva. 

(1) W. D. Waker and W. D. SHEPHARD: Phys. Rev., 101, 1810 (1956); W. B. 
Fow er, R. P. ScHurT, A. M. THorNnDYKE and W. L. WHITTEMORE: Phys. Rev., 91, 
1287 (1953); 93, 861 (1594); 98, 121 (1955); L. W. ALVAREZ, H. BrADNER, P. FALK- 
VAIRANT, I. D. Gow, A. H. RozenrELD, F. T. SOLMITZ and R. D. Tripp: UCRL 3583, 
University of California Radiation Laboratory, Berkele¥. 1956. 

(*) At the time that the present work is being prepared for publication, evidence 
for the higher spin of the hyperons is weakening, see Proceedings of the Rochester Con- 
ference of High Energy Physics, 1957 (to be published). 
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The theory of higher spin fields proposed by FIERZ (*), and FIERZ and 
PAULI (*) has been simplified by the work of RARITA and ScHWINGER (5), 
GUPTA (°) and MOLDAUER and CASE (7). In spite of these simplifications, 
however, calculations are very lengthy, as illustrated by the calculation of 
the Compton scattering cross-section by MATTHEWS (8). 

It is the aim of this note to present a new simple formulation of the Fierz- 
Pauli theory, and to analyze in some detail the structure of this theory for 
arbitrary spin. 

All relativistic theories of free fields are based on the principle of invariance 
under the group of co-ordinate transformations known as the Lorentz group (°). 
This principle requires that the wave function form a basis for a representation 
of the Lorentz group. The simplest kind of field is defined by an irreducible 
representation and is said to describe an elementary particle, which provides 
an exact and natural definition of the latter concept. 

The Lorentz group possesses two invariants 


2 = PDP, 3 
2152 == tLe Le SE LL ,3p,p° x 


In an irreducible representation these must be multiples of the identity. 
Define, therefore, two numbers m and s, such that 


(a) po=—m?, (Definition of mass), 


(b) Da s(¢ 1), (Definition of spin). 


In all the cases of physical interest m is a positive number or zero, and 2s 
is a positive integer or zero. In these cases there is only one representation of 
the Lorentz group for any given set of values of m and s (1), apart from an 
ambiguity in the choice of reflection operators (1). In the following the value 
m= 0 is excluded from consideration, unless otherwise stated. 

In the first three sections conditions (a) and (b) are formulated in terms 
of conditions on the wave function, in the case of no external forces. A «spin 


(3) M. Frerz and W. PauLI: Proc. Roy. Soc., A178, 211 (1939). 
(5) W. RARITA and J. SCHWINGER: Phys. Rev., 60, 61 (1941). 
(6) S. N. Gupra: Phys. Rev., 95, 1334 (1954). 
(7) P. A. MotpavER and K. M. Case: Phys. Rev., 102, 279 (1956). 
(8) J. MATTHEWS: Phys. Rev., 102, 270 (1956). 
(9) The short term «Lorentz group» is used for the « Extended inhomogeneous 
Lorentz group », which includes translations and reflections. 
(19) E. P. WiGNER: Ann. of Math., 40, 149 (1939). 
(11) L. L. FoLpy: Phys. Rev., 102, 568 (1956). 
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projection operator » is introduced, which greatly simplifies the treatment of 
the subsidiary conditions. In Sect. 4-9 the Fierz-Pauli theory for electro- 
magnetic interaction of particles of arbitrary spin is analysed, with the aid 
of the spin projection operator. 

In the last sections polarization operators are introduced and applied to 
the calculations of angular distribution of hyperon decay products. 


1. — The free fields. 


The simplest irreducible representation of the Lorentz group is that for 
which s=0. The wave function is a single function of p, and satisfies the 
condition (a) 


(1.1) (p? + m*)p(p) = 0. 


This is recognized as the Klein-Gordon equation. 

Four functions y,(p), which transform like the components of a vector, 
provide the basis for a reducible representation. Designating this represen- 
tation by D, we have in fact 


D = D(1)® DO), 


where D(s) is the irreducible representation corresponding to the spin value s. 

Irreducible representations corresponding to any other integral spin value 
may be formed by taking direct products of D with itself and expanding the 
products in Clebsch-Gordon series. For example 


D @ D = D(2)® 3D(1)@ 2D(0). 


The quantity that transforms according to D‘ — D @ D @ … @ D is the tensor 
of rank s. The irreducible representation D(s) is (2s+1) dimensional. The 
wave equation (1) applies to each component of the wave function, while con- 
dition (b) requires that the projections of g on the spaces of representations 
of lower spin values vanish. When s = 2, this condition may be written 


n ns 
p ‘Pipa x 0 ’ 
1.2 Pe 
( 2) Put che Prati ’ 


My Ha = 
I Patte = 0 ? 


where ,,,, is a tensor of rank 2. Equations (2) are equivalent to Eq. (6), and 
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are referred to as subsidiary conditions (2). Similarly the general integral 
spin field is defined by the following set of equations: 


(1.3a) (p? + MVP an = 0» 
(1.3b) Ven sain in 
(1.3c) POO un = 0: 
(1.34) dor =. 0.5 


where s is the spin. 

If the spin is s=n-+4, n integer, the wave function is a tensor of rank n, 
each component of which is a Dirac four spinor. 
Equation (3a) is replaced by the Dirac equation (18) 


(1.3a*) (Pt MP = 0; 
and Eq. (34) by the subsidiary condition 
(1.3d*) Pike AT a 0 9 


Equation (3a) is a consequence of (3a*), and the only additional information 
contained in (3a*) concerns the choice of reflection operators (11). 


2. — The spin projection operator. 


It is convenient, temporarily, to express the subsidiary conditions (1.3), c, d) 
or (1.35, c, d*) by the symbolic notation 


(2.1) Hg 0, best 2,09% 

(2) The equivalence of Eqs. (1.2) and Eq. (5) may be understood by noting that 
the former equates to zero all the tensors of lower rank that can be formed from 9,,,,,, . 
Thus, g/"p,,,,, is a scalar, p#p,,, 18 a vector, and the antisymmetric part of p,,,, 18 a 
« six-vector », which transforms by the sum of two irreducible representations of spin 1. 
In Eqs. (3) below the components of y,, ... are accounted for as follows: the 4° com- 


s+3 j 
ponents are reduced to | 2 | by the symmetry condition. Subtracting the numbers 


2 De 
E | and (;) of conditions imposed by the other two subsidiary conditions leaves 


2s+1 independent components, as is appropriate for the spin 2 field. 
(5) Notation: p = y,p", where y, are the four-by-four Dirac matrices, defined by 


the commutation relations 
Yury «E ya = 29 ww . 
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Similarly, for (1.3a) or (1.3a*) 
(2.2) ng = 0. 

Introduce an orthogonal Do operator: 
(2.3) 0=0=0:, GO=y»,6',, 


with the following properties: (i) ifm, is a tensor of rank s (or a spin-tensor 
of rank s —4), then (09), = y, transforms according to D(s) (i.e., 7p = 0), 
(ii) if a specific (spinor-) tensor g,. transforms according to D(s), then 
Pa. =(09),.., and (iii) if g,.. satisfies Eq. (2), so does (Oy), : Symbolically, 
omitting indices: 


ni( Op) =0, (i), 

(ng = 0) > (vy = 99), (Condition of uniqueness) (ii), 

(np =0) > (709 = 0), (Commutativity with n) (iii). 
It will now be shown that the only non-trivial solution of (3) and (i) satisfies (ii) 
and (iii). 


Uniqueness of O. In view of the complete symmetry of (0g),. in all 
indices, © must be constructed from the metric tensor, the vector p, and 
(for half-odd-integer spin) y,,: 


(2.4) On = > {ag Dre dc bi ed pv EE 


where the sum is over all permutations of the lower indices. The defining 
equations (i) may be written 


(2.54) SL tn 

(2.5b) PO = 0, 

(2.5c) JO ARR US 2s even, 
(2.54) yh < a0. 2s odd. 


Combination of Eqs. (4) and (5) gives 


Vi Che — Ai goes 
oe Giri au > ou = an! di à 
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Provided © is not identically zero, Eq. (3) yields 
an! =1. 


If g,.. satisfies all the subsidiary conditions, Eq. (4) gives 
TE = à Dos = Prius ? 
p 


which proves that (ii) is a consequence of (3) and (i) unless @ is identically 
zero. The uniqueness of @ follows. 


Commutativity of O with n. The consistency of Eqs. (1) with Eq. (2) re- 
quires that 


(2.6) [ms np=ny =0. 


If this condition is a consequence of Eq. (1) alone (14), the set (1) may be said 
to be complete with respect to 7. If it is not, simply include the new equa- 
tions (6) in the set (1), and repeat the process untill a complete set of sub- 
sidiary conditions is obtained. No generality is lost therefore, by assuming 
that the original set was complete. This is actually true of the set (1.35, c, d) 
or (1.35, c, d*). 

The commutativity of O with 7 is an immediate consequence of the comple- 
teness of the set of subsidiary conditions. The latter may be expressed by 
the equation 

[m:, 7]O=0, 
or 
LO, n] =—nnO=0. 


Because of (ii), this means that 

0[0, nl =[0,n]. 
Since 0 and either 7 or iy are self-adjoint 

[9, nl’ = + (6, n]. 
Applying the last two equations 


[0, n]= OL9, n1=# 019, nj =F OLOL4, nl} = F 0[6, 1l O= 00, 7]O= 0. 


(15) LE for example Pp = VED > M2 = PP» 7 = p+im, then [mi ’ n |p = np a 2pl@,= 
= 229 = 0. 
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This completes the proof that (iii) as well as (ii) are consequences of (i) and 
of Eq. (3). For integral spin (iii) is trivial; for half-odd integral spin it means 
that O commutes with p. 

The actual determination of @ for arbitrary spin has been carried out in 
Ref. (5). The results are, for integral spin s 


(2.7) Gli — a) 5 i O + a00, 9%" 110% +... + 


P(x) | 1=1 i=3 
P(p) A gill 
CEN Ara," Ps. Oxa OF SE TER 
Og yf: Ole ON O 
where 
a, = (—}4)’s! {r! ( 8 — 2r)! (2s — 1)(2s — 3)... (28 — 2r +1)}> 


Opulent nla 
For half-odd-integral spin s = n 4 
28 cb ni 
4(s +1) 


This last formula provides a very simple proof of the commutativity of © 
with p 


10.0) Ofe-bn(s) = yy eee (n +1). 


2s+1 = By Bn 
Pag 2 x È ay By errs 
COR) PI = 96 4-4) VP PV) Lim +1) = 


One of the main uses of the spin projection is through the formulae 


2) Dop, po = At Oh, 


for half-odd integer spin, and 
(2.10) Di Pan Pe = ORS, 


for integer spin, where the sums are over the positive energy solutions of the 
wave equation and the subsidiary conditions. Equation (2.9) has been used 
in Sect. 10 and in reference (°). è 


3. — Electromagnetic interaction (1). 


The first attempt to introduce an interaction between the higher spin field 
and the Maxwell field was made by Drrac (17), who started from Eq. (1.3) 


(5) R. E. BEHRENDS and C. FRONSDAL: Phys. Rev., 106, 345 (1957). 

(5) Only half-odd integer spins are considered in Sect. 3-6. Summary results for 
integer spin are given in Sect. 7. 

(7) P. A. M. Dirac: Proc. Roy. Soc., A 155, 447 (1936). 
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and carried out the substitution 


(3.1) Di Dek 


ee 


Although this substitution represents the only known method of introducing 
a gauge-invariant direct interaction between A , and the particle field, it does 
not uniquely determine the form of the interaction. Indeed, different theories 
are obtained by starting from different, though equivalent, formulations of 
the free field theory. 

It was shown by FIERZ and PAULI (4) that Dirac's method leads to in- 
consistent equations for the interaction of g with an arbitrary external electro- 
magnetic field. An alternative method was proposed, that avoids the in- 
consistency of the Dirac theory, without completely removing the ambiguity. 

Although it is not certain that the Fierz-Pauli theory is self-consistent 
(in particular difficulties seem to present themselves with regard to quan- 
tization (18)), it has been rather widely accepted. It is, therefore, of some 
interest to exhibit, in a simple way, the intimate relationship between the 
various formulations of the Fierz-Pauli theory, as well as the importance of 
the spin projection operator in this connection. 

The theory of Fierz and Pauli is based on the requirement that all the 
field equations be derivable from a single Lagrangian variational principle (1°). 
While this requirement is satisfied by any formulation of the free field theory, 
the same may not hold after the substitution (1) has been carried out. The 
method therefore consists of determining a Lagrangian for the free fields, and 
then carrying out the substitution in the Lagrangian. 

Application of Eq. (2.5) shows that the wave equation and subsidiary 
condition (1.3) may be deduced very simply from the following equation 


(3.2) (Op +im)p = 0. 


In the momentum representation p, is a c-number, so that Eq. (4.2) may be 
obtained from the Lagrangian | 


(3.3) £ = (Op + img 


by variation with respect to g However, since Op involves inverse powers 
of p?, it does not seem possible to treat with conventional methods the theory 


(18) S. KusaKka and J. W. WEINBERG: University of California Doctoral Dissertation 
of J. W. WEINBERG (Berkeley, 1940). 

(19) That is, all equations which are postulated «a priori » (i.e., before the variation 
of the Lagrangian is carried out.) « A priori» conditions on the wave function are 
permissible provided there are properly taken into account under the variation: the 
variations dp, ... are not all independent. This was overlooked in the theory of Mol- 
dauer and Case. 
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which would result from applying (1) to Eq. (2). Indeed, it is a practical 
necessity that the Lagrangian be linear in p,. 

The remainder of this section deals with the problem of replacing (3) by 
a Lagrangian which is linear in p,, with the objective of relating to each 
other the various existing formulations of the Fierz-Pauli theory. The dis- 
cussion is limited to the case of spin 3, since much of the litterature treats this 
case exclusively. In the following section a more general approach is taken. 


The case of spin 3. Eq. (2) may be reduced to the form of a first-order 
differential equation. The corresponding first order Lagrangian is equivalent 
to (3) in the sense that variation leads to the same equations (wave equation 
as well as subsidiary conditions) for g,. After the substitution (1) has been 
carried out, this will no longer be true. 

Eq. (2) may be written, omitting indices 


(3.4) —imp=(1-tyy)pOg', 
where gy’ is the projection of g which satisfies y:pg=0 : 


(3.5) gp’ =(1—tyy)p.- 


The explicit expression for © is 


1 1 
3.6 OS] Sey 
(3.6) 1 37 3p° (pyp + PYP) - 


When Eq. (6) is inserted into Eq. (4), the result may be written 


(3.7) — imp = (1—+yyXpy'— py), 
where 
(3.8) î 

i y= 3p? PP’? 


is a four-component spinor field. 


Multiplying Eq. (8) by — im, and substituting for —img from Eq. (7), 
there results 


(3.9) —imy =—tpy+ hp’. 


Eqs. (7), (9) are equivalent to the original set (1.3), and are seen to be iden- 


tical to the equations of Fierz and Pauli, in the form obtained in the Ap- 
pendix. 
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Later formulations of the Fierz-Pauli theory do not require the use of the 
auxiliary field y. In the following it will be seen how y can be eliminated, and 
a general theory obtained that includes, as special cases, the theories of Rarita 
and Schwinger (5), Harish-Chandra (2°) and Moldauer and Case (7). 

Equation (7) is equivalent to 


(3.10) — imp = (1—t vy)(py'— py) , 
(3.11) ate) 


The vanishing of p-g' and hence of y does not depend on Eq. (11), but may 
be inferred from Eqs. (9), (10). Since the latter do not involve the projection 


g= +7) 9; 

which is orthogonal to g’, it is possible to identify the field y (which vanishes 
by virtue of Eqs. (9), (10)), with the field y-g = y-p" (which vanishes by virtue 
of Eq. (11)). Thus the auxiliary field y may be eliminated by writing 
(3.12) yp=ay?7, a# 0. 


This makes Eq. (11) superfluous, while Eqs. (9), (10) become, 
respectively 


ace 1 Te | 
(3.13) ee UP pe ec PES PEN 
(0238) — imp'= (1 —+yy}{pp— apy") - 


Eqs. (13), (14) may be replaced by any linear combination of them 
(3.15) k(3.13) + (3.14), k#0, 
as is seen by multiplying first by ¢yy and then by (1—17y) 
+ yy(3.15) = K(3.13) , 
(1—+yy)(3.15) = (3.14). 
(2°) HARSH-CHANDRA: Phys. Rev., 71, 793 (1947). The theories of References (5) 
and (7), as well as the part of Reference (?°) which is of interest in the present con- 


nection, are all equivalent to the Fierz-Pauli theory. The latter, however, suffers from 
a cumbersome formulation (see Appendix). 
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When Eq. (15) is written out in full, and multiplied by y, the Lagrangian 


1 
CLONE A a — ky) +p=(5+ a) py + 


| LR IR Le Ea | 
TC: 8a) ?P?{? 
is obtained. 


In order that the equation derived from (16) by variation of g be the adjoint 
of Eq. (15), the Lagrangian must be Hermitian. This imposes the condition 


(3.17) k = — 12aa* 0, 


and (16) reduces to, with A=—a—-3#—}, 
(3.18) 2 = G{(p + im) + Apy + A*yp + 4(8AA* + À + A* + 1)ypy — 
| —im(3AA* + 3A _ 3 À * + 1)yy} x 


This is a two-parameter family of Lagrangians, and reduces to a one-para- 
meter family which is identical to that obtained by MOLDAUER and CASE (7), 
when A is taken to be real. The Rarita-Schwinger (5) theory is obtained by 
the special choice À —— +. From the theory of Harish-Chandra (?°), which 
describes particles of mixed spins, 3 and 3, PETRAS (?!) has extracted the 
spin 3 part. Petras’ Lagrangian, which is not Hermitian, corresponds to the 
special choice k=1, a=—} EVA 

The complete generality of (18) is not guaranted by the above analysis, 
but is proved rigorously in Sect. 5. The proof rests on some general theorems 
concerning first order wave equations for arbitrary spin; these are obtained in 
Sect. 4 for half-odd integer spin, and in Sect. 7 for integer spin. 


4. — The minimal condition. 


The most general Euler-Lagrange equation which is obtained from a Her- 
mitian; Lorentz invariant Lagrangian, and which is of the first order in Ps 
and linear in y, may be written 


(4.1) (p'a, + imP)p = 0 ’ CM È x, ? Be 3 p ’ 


where the matrices x, and B are form-invariant, numerical matrices (74); 


(21) M. Perras: Czech. Journ. Phys., 5, 2 (1955). 
(??) Note that in the spin } case, Lu = Vus B= 1. 
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The wave function y transforms according to some reducible represent- 
ation D, of the Lorentz group. The transformations consist of an «orbital 
part» and a «spin part», and the two parts commute. Therefore the spin 
matrices are, by themselves, a representation of the Lorentz group; DL) 
say. This may be written as a direct sum of irreducible representations D(i, L), 
with i=}, 3,..., s, (we are dealing in this section with half-odd-integer spin 
only) 


(4.2) D(L) = D(s, L)® D(s—1, L)@...® D(s—1, DO Dis — 2, LO... 
= D(s, L)® > n,D(i, L), 


The spin value s appears once, while the number n; is the multiplicity of the 
spin value è. 

Commutation relations for the matrices «, and f will now be derived from 
the following requirements. First, that Eq. (1) be equivalent to the following 
two equations 


(4.3) (1—O)p =0, 
(4.4) (p + im) Op =0. 


Here Og is the part of g which transforms according to the first term of 
Eq. (2). Thus Eq. (3) is the subsidiary condition which requires that the 
spin be unique, and Eq. (4) is the wave-equation which we impose on 0g only. 
Notice that the representation is not necessarily in terms of Dirac spinors, 
so that Eq. (4) means that there is a wave equation which would take that 
form if such a representation were used. 

Eq. (4) is the condition that the mass be unique, but reflects also the choice 
of reflection operators. Specifically, it is invariant under the substitution 


(4.5) Opps, P) > ya OP(Pas — P) - 


The reason for demanding this invariance is simply the success of the Dirac 
equation for spin }. As long as one is discussing free fields only, there is no 
reason to extend invariance under (5) to hold for the entire wave-function. 
However, when the electromagnetic interaction is introduced, Eq. (3) will no 
longer hold. Analogy with the spin —4 case therefore suggests that Eq. (1) 
be invariant under a transformation of the form 


(4.6) (Pas P) > YaP(Pas — P) - 


The choice of y, in Eq. (6) is not imperative, but any other choice consistent 
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with Eq. (5) is easily seen to be equivalent for our purpose. An alternative 
way of stating the invariance under (6) is to require that, if a representation 
in terms of 4-spinors is used, the x, and be expressible in terms of y,-matrices 
and the metric tensor (i.e. no o-matrices or y, will appear). 

Thus, the requirements from which commutation relations for «, and p 
will be derived are the equivalence of Eq. (1) to the set (3), (4), plus the 
invariance of Eq. (1) under (6). 

A component of y may be labelled by the number i, referring to the re- 
presentation D(i, L) (see Eq. (2)), a number t,, (1<t<n;) which refers to 
one of the n; representations D(i, L), a number s; taking on 2% +1 values 
and a sign (the sign of the energy). The two latter refer to the 2(2i+1) 
components of D(i, L). Hence 


(4.7) y =|t, 4, 8, E>, E= +. 


It is clear that, by definition 
(4.8) Ke, 855 4, t,| O Le d', 8; e) = ds 051594, Days: de . 


Consider the sub-group L° of L under which a given momentum vector p° 
is invariant. This was called the little group by WIGNER (?°), who proved 
that D°(L°) may be written as the following direct sum of irreducible repre- 
sentations 


8-1 
(4.9) D$(L°) = D+(s, L)® D(s, L)® Yan, [DHi, L)@ Di, Le)]. 
i=} 
Form-invariance of «, and 6 under L, means that «,p°” and 6 commute with 
DY(L°). By Schur’s lemma: 


(4.10) <e, 8,, it, |x, p|t,, (8,2) = dis ss: CE Sig it, | ope” |t;, 4, 8, 8") 


and similarly for 6. Invariance under (6) gives (as is easily seen in the rest- 
system of pl) 


<& S;) i, t,|a,p®"|t,, i, 8) €) = (P°)ee Ut) ’ 
CEy 8,54, t, |B \t,, à, Si) e > = Os b(i), È 


Formally 
s—1 

(4.11) a,p" = p @[a(s)® > a(i),«], 
i=} 


(4.12) B =[B(s)® ol, 
t=} 


a 
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where a(t), ,, and b(i),,, are numerical matrices diagonal in the dimension 1. 

Here the superscript «» on p” has been dropped, since Eqs. (11) and (12) 
are explicitely covariant. 

Comparison of Eqs. (8), (11) and (12) gives immediately (since a(s) and 


b(s) are one-dimensional) 


(4.13) ap" O = 0a, p" = pOa(s), 
(4.14) PO = Of = OWs) . 


Hence Eq. (11) may be written as two separate equations 
(4.15) [pa(s) + im b(s)]Op = 0, 
(4.16) [x,p" + imB\(1—O)p = 0. 

Up to this point the form-invariance of «, and # plus invariance of Eq. (1) 
under (6) have been exploited. There remains the requirement that Eq. (1) 
be equivalent to Eqs. (3) and (4). In view of the above this means, first, that 
(4.17) a(s) = b(s) = 1, 
and, second, that Eq. (16) reduce to Eq. (3), i.e. that the secular determinant 
(4.18) Det [(,p" + imB)(1 — @)] 
must be non-zero for all values of m. It is seen that this is impossible for 
m= 0, so that this case must be excepted. Next, 6 must be non-singular, 


or more precisely, 6g must have the same number of components as g. Then 
there exists a matrix defined by 


B Bp =@. 
The determinant (4.18) may then be replaced by 
(4.19) | Det [(I,p"+im)(1— 0)], T= Ba, 
According to Eqs. (11) and (12), 
(4.20) r,p=p®l1® 2 Cae Cli) =>) ali), 
and (4.18) is the product of the determinants 


(4.21) Det [p © C(i), + imo, 41, DS, 1e — ly sot 


dat ur. PRE NN TE PRIE TONNERRE 
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These determinants are non-zero only if they are all of the form 


ny 
. 


(im) 
But every matrix satisfies its characteristic equation, hence 
(4.22) fea) —'0. 


This is the main result. It may be written in a number of ways, the most 
striking being Eq. (27) below. Conditions of this form are referred to as 
minimal conditions. 

It has been proved that the necessary and sufficient conditions that Eq. (1), 


with form invariant «, and f, be equivalent to Eqs. (3) and (4), and be inva- 
riant under (6), are that the matrices satisfy the conditions 


(4.23) i De Q = [e PRE Ti = pa , 
(4.24) pO = 08 =O, 
s-1 
(4.25) Tp" = p ®[1@ > Ci). , 
i=} 
s-1 
(4.26) p =1@ >, b(i), 4; ’ 
(4.27) (Nr) 0 ={(p)} 6, 
where 
(4.28) ‘hs a pra, bj 


and 7” is the largest of the numbers n,: Eqs. (23) and (24) are simple resta- 
tements of Eqs. (13), (14) and (17). Eq. (27) is completely equivalent to 
Eq. (22). 


Note. — If the wave-function is a tensor spinor of rank s—4, and sym- 
metric in all the tensor indices, #=n,=s+4, so that 


ER Con Vi SE (p)*** O . 


This can be realized only if s =} or 3, since only then is the right-hand side 
a polynomial in p,. This proves a statement by KUSAKA and WEINBERG pee 
that a symmetric tensor-spinor of rank 2 is insufficient to describe the spin 
3 field by a first order wave equation. If, however, a general (i.e., not 
Symmetric) spinor-tensor is used, % will always satisfy the minimum requi- 
rement of making the right-hand side of Eq. (27) a polynomial in Py 
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5. — Wawe equations for spin 3 


In Sect. 3 it was seen that the spinor-tensor 7, is adequate for the con- 
struction of a first-order wave equation for spin 3, and a Lagrangian (Eq. (3.18)) 
was constructed. With the aid of the results of the preceeding section, the 
complete generality of that Lagrangian will be demonstrated. 

The reduction of y, according to irreducible representations of the Lorentz 
group may be carried out as follows 


Qta 


with 
A'p, = V(1/12)(y"— 4p"p|p)9, , 
B'g,, 7 379, p 


Here Op is the spin 3 part, having 8 independent components, and the other 
two parts have each the four components appropriate to spin + fields. Further- 
more 


Hc Be D De Où 
TATA = B - B = 1 . 


In this representation the explicit form of J’,p*, as given by (4.25) is 
(5.1) (pò) = Op + AC pA + AC:pB + BC,,pA + BC»pB. 
It is now a simple matter to impose the requirements 1), that /7,p? be of the 
first order in p,, and 2), that the square of the matrix C,, be zero, as required 
by Eq. (4.27). Next the most general form of the matrix f is written down 
(5.2) pi=>. 14 constiyy . 
It is immediately found that the Lagrangian 

2 = p{BI,p+imB}9, 


with /’,p, and B given by (5.1) and (5.2), is identical to the one found in 
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Sect. 3 (Eq. 3-16). This constitutes a rigorous proof that the latter is indeed 
the most general Lagrangian which is linear in p,, and from which all the 
field equations may be derived by a single variation, when the wave equation 
for the spin 3 field is taken to be the tensor-spinor y,. In particular (3.18) 
is the most general hermitian Lagrangian, under these conditions. 


6. — Algebra of the /-matrices. 


Commutation-relations may be derived for the matrices /",= ‘x, from 
the minimal condition (4.27) and from (4.23), (4.24). In terms of bie 


LE. = Pp" O ; 


(6.1) 

(T,p")O = 0 (I,p") = po, 
or 
(6.2) (C,p’— PII,P"Y = 


In the rest system 


(6.3) (Ty — Va) a — 


In these equations » is the largest number of fields of given spin that appear 
in the wave-function y. The esixtence of a relation of the form of Eq. (3) 
was derived by KUSAKA and WEINBERG (18), who did not give the present 
definition of the number 7. Neither was Eq. (1) given by these authors. The 
weaker condition 


(6.4) L (DELI =0, 


has been given by HARISH-CHANDRA (°°) and by UMEZAWA and VISCONTI (22). 
Although the methods of these authors differ somewhat from each other, the 
basis of the argument is in each case the requirement that every component 
of the wave-function satisfy the Klein-Gordon equation. In the present ana- 
lysis Eq. (4), as well as the stronger condition (2) have been derived from the 
requirements that the solutions of Eq. (1) describe particles of unique spin, 
and that a unitary parity operator exist. 

In the rest system 0 = I", by Eq. (1). Hence 77" is an idempotent. This 
can also be seen by iteration of Eq. (4). 

Since the p , are arbitrary, conditions more general than Eq. (3) may be 
deduced from Eq. (2) 


(6.5) LUO > val uly I 


P(u) Ply) 
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and the weaker conditions 


(6.6) eo inl, = See, (IUT, 


P(u) Ly P( py) 


where the sums are over all permutations of the indices. Special examples 
of Eqs. (5), (6) are needed for the calculation in Sect. 8 of magnetic moments 


(6.7) PL, PLS =0, 

(6.8) POP ADTD)IA LOT —402) FP dg; 
i=0 i=0 

(6.9) I, PA FR ne P, ? 


where ?, j — 1, 2, 3 and 


(6.10) PELI FP ee TS): 


7. — Integer spin fields. 


In the case of integer spin, not first-order but second order wave equations 
are considered 


(7.1) (x,,P"D" + m>B)p = 0. 


In the simplest non-trivial case, that of spin 2, it proves sufficient to work 
with a symmetric, traceless second rank tensor g,,, and one auxiliary scalar 
field y. Starting from the wave equation (in terms of an arbitrary 2-nd rank 
tensor) 


(Op? + mp = 0, 
a field g' that satisfies the algebraic subsidiary conditions, is introduced 
(7.2) Pur i [4(0%, 62 + dl 6) DC 16,0 2lpx i 


The terms in ©p? containing powers of 1/p? are absorbed into a new scalar 
field y, which is subsequently identified with y: There result two equations 
that may be added in analogy with Eqs. (3.13), (3.14), to give a 2-parameter 
family of Lagrangians. | 

Considerations very analogous to those of Sect. 4 may be applied to the 
integer spin case. The main results of Sect. 4 are contained in Eqs. (4.23-27), 


28 - Supplemento al Nuovo Cimento. 


434 C. FRONSDAL 


and have the following analogues, respectively 


(7.3) (p°x-p)O = O(p'a-p) = pO, 
(7.4) pO = OB =O, 

(7.5) p-I-p =PL1® > Ci] 
(7.6) 8=01© Sol, 
(7.7) (pl EEE DE), 

where 

(7.8) PSs rx, 


and B-1 is defined by 
BB — 9. 

These results may be applied to the derivation of the most general wave 
equation for spin 2, in terms of the wave function g,,. For a traceless, sym- 
metric p,,, the dimensions of the C(i) matrices are nn =%»=1. Thus # — 1, 
and Eq. (7) cannot be satisfied. Relaxing the trace condition on g,,, there 
results n,=1, n=#=2. Alternatively, the symmetry condition may be 
given up, so that n,=n=3, n= 1. 

In the general case of integer spin s, the wave function may be taken to 
be a tensor of rank s. A completely symmetric tensor cannot be used, since 
then n = nm) = 34(s+2) or 3(s+1), according to whether s is even or odd, res- 
pectively. But this does not make the right hand side of Eq. (5) a polynomial, 


except when s= 2. If, instead g,,,.. is completely traceless, n,_; = ( 
in general is much larger than what is required by Eq. (5). . 
From Eqs. (3), (4), (5) commutation relations may be derived in the form 


> Ts Pl Ig) 0 


P(u) 


} which 


8. — Magnetic moment of fermions. 


The magnetic moment of particles defined by an equation of the form 
(8.1) CIE + imlp = 0, IT" = p"—ieA", 


was considered by HARISH-CHANDRA (2%). HARISH-CHANDRA finds, in the non- 


(**) H. UMEZAWA and A. VISCONTI, see H. UMEZAWA: Quantum Field Theory (Am- 
sterdam 1956). 


(24) HARISH-CHANDRA: Proc. Roy. Soc., A 195, 195 (1948). 


ON THE THEORY OF HIGHER SPIN FIELDS 435 


relativistic approximation, that Eq. (1) reduces to 


eee Si 
(8.2) {ur + im) — 5 IPP, T,(P_+2.5 rer} Pio, 


where P, were defined by Eq. (6.10) and 
12, = il ES TRIS . 


This result relies on Eq. (6.4) only, and the number # remains unspecified 
in Harish-Chandra’s theory. 
Noting that the form invariance of I°, requires that 


(3.3) D = 181 — Sali, 


where $,, are the spin transformation matrices, there follows by a generalization 
of a technique due to Petras 


n-1 n—1 
(8.4) PET: ( Pap 2 > Tui Pod LPs = 3 PDS — Sails 3 2 TéPo) TPeik 


i=0 


n=1 
TP al (Pl 2 > Pia Sul) Pr = Pa (Ly Ly Sale 


1=0 


When this result is introduced into Eq. (2), that equation becomes 


) 1 Ù tj | 


where the magnetic moment M;; is given by 


é 
dm P4(84T;— Syl) Ps 


(8.5) Mi 

Any relativistic wave function may be considered as being composed of 
two parts as follows. One part contains the spin s field, and is a tensor-spinor 
of rank n=s — 4. This part of the complete wave function g (written without 
indices) shall be written g,...,, (with indices). The other part of g does not 
contain a spin s field, and must therefore vanish as a consequence of the field 
equations. This part will be written y. The projection (0g) is the set of wave 
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functions which satisfy the following conditions 


(i) php. 
(ii) VR Pr cca 
(ili) P.. MATE Vania? 
(iv) p=0. 


Let an infinitesimal Lorentz transformation be carried out 


PA DE 
(8.6) y 
(D) > p(p') + doi: q(p) . 


When these substitutions are made in Eqs. (i)-(iv), there follows that 
(nrg = 0) > (Mr Sup = 0). 


Here n,g — 0, r=1, 2,3 stands for the three conditions (ii), (iii) and (iv). 
By the definition of © this may be written 


Nr Sai O "= 0 ’ 
or, in the rest system 


(8.7) Veage 4 = 0 . 


With respect to (iv), this means that the part y of the wave function may 
be ignored in the calculation of the magnetic moment. Then the /,-matrices 
must be expressed in terms of the invariant matrices g,,, y,, and ¢,,,,. When 
an expression of this form is introduced into (5), it is immediately recognized 
that, because of Eq. (7), the only surviving term is that which is diagonal 
in all tensor indices. By Eq. (6.7), this term is equal to y,. Hence 


(8.8) Mi = (e/2m) P (Sy; a Sayi) P+ . 
The explicit form of S,,; is (ignoring the part which acts on Y) 


(Sa)! DI Ss =ipyidh... + > DE (dus, Om ad 0, 0 Aged Lo 


When this is inserted into (8), all but the first term is annihilated by the 
projection operator P,, and there remains 


M, = (e/2m)P,y;y;P;. 


"A 
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By means of some simple algebra 
(Pv (Py = 
= (Pl (5g: — 4(1 — 1/28 yp, yp") yi ys (05 — $ (1 — 1/28) 5,7") (Pa). = 


sc 1 1 di air I 
tp) 3 [yiy: 05 … — > Op: ... (pn 0 — 046,03) (PRE. = 3 Babi. 


m 


Hence the eigenvalues of the magnetic moment are given by 


e 48 
(8.10) iii da 
2m S 


where <S,) are the eigenvalues of the k-component of the spin, and è, j, k=1, 2,3 
cyclically. 

This formula has been given by MoLDAUER and CASE (7), but derived 
rigorously only for spin 3 (see footnote (!°)). PETRAS (21) gave the correct 
value for the magnetic moment of spin 3 particles, although the Lagrangian 
underlying Petras theory is not Hermitian. Recently evidence concerning the 
spin of the muon has been obtained, which is partly based on Eq. (10). 
GARWIN, LEDERMAN and WEINRICH (2°) have found that the gyromagnetic 
ratio of the muon is 2.00. The gyromagnetic ratio of particles obeying the 
Fierz-Pauli equation (1) is given by Eq. (10) to be 1/s. Hence, if the muon 
is a Fierz-Pauli particle, its spin must be 3. 


9. — Polarization operators. 


In addition to the spin projection operators @ introduced in Sect. 2, 
projection operators which select certain values of the z-component of the 
spin are needed. (For convenience polarization is always referred to the 
z-direction.) These will be referred to as polarization operators. The simplest 
example is encountered in the case of spin 4. Polarization operators for this 
case have been used by MICHEL and WIGHTMAN (25), and BOUCHIAT and 
MICHEL (2) Defining 


(9.1) Pls) =: (Si +8), 


where S, is the spin operator with eigenvalues +3, it follows that 
P(s)y(s') a 6,..p(8') ’ 
(25) L. MicHEL and A. S. WIGHTMAN: Phys. Rev., 98, 1190 (1955). 


(26) C. BoucHiaTt and L. MicHEL: Compt. Rend. Acad. Sci., 243, 642 (1956). See 
also C. FRoNSDAL and H. ÜBERALL, Phys. Rev., to appear. 
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where y describes a pure spin state. The spin operator S, may be taken to 
be either 


7 I uv " a 
Sy= di Sy, OP Sy= 9 V:8°Y n » 
where, if the momentum is directed along the z-axis 
se har — TE st = {o, EE 


It is easy to see that 
S,(p — im) = S,(p — im) 


so that the two operators are the same when applied to solutions of the Dirac 
equation. Previous authors have used the operator Si here the alternative 
S, will be found advantageous. (The prime will be dropped henceforth.) 

The generalization of (1) to the case of a total spin of n+4 with 2-com- 
ponent s is 


(9.2) P,(s) = d(n, 8) IT (8,1; —8); 


Pet 


where s’ runs over the eigenvalues of the +-component of the spin, and 


4 ) di n, : 
(9.3) (Snsglaran = Bd eee dus + D OM ue Sim ue Oar 


m=1 


The normalization factor may be found by noting that 


Sn+3Pn(8) = sP,(8) . 


Hence 
IE AAMC ce 
and 
(9.4) P,(8) = UI (&—s')}? II (Su —s). 


The polarization operator commutes with the spin operator, as is seen by 
verifying that 


p™ (8,40): 


1 


7" (8,30) = 0. 


es 
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Examples of (4) that will be used in the following section are, for spin 3 
and 3, respectively 


(9.5) P,(s) = {28(s? — 9/165?) -1(S, + s)(S; — 9/165?) , 
(9.6) P,(s) = {28(2st— 3582/4 + 225/648?) }-1(8, + 8)- 
*[S + (8? — 35/4) 85 + 225/6452]. 


When only the magnitude, but not the sign of the spin component is relevant, 
the appropriate projection operators are 


P.(131) > P.(a) Bs) ) ={T] (8— s9)}- IT Sig 8). 


STATS] [8'T1#151 


10. - Angular distributions. 


Calculations on angular distributions of the decay products in interactions 
involving particles of higher spin will be limited to one particular case: the 
decay of hyperons through the scheme 


KHK>NnN+r, 


where # is a hyperon and © is a nucleon. Of all the elementary particles 
which are known or suspected to exist, the hyperons are most likely to have 
higher spin (1-2). 

The angular distributions of the hyperon decay products have been cal- 
culated by ADAIR (27) and TREIMAN (28). The present calculations differ in 
two respects. First, ADAIR and TREIMAN assumed that parity is conserved 
in the decay. This assumption has been found to be inconsistent with observed 
facts, in recently performed experiments (8°), and will not be made here. 
Second, earlier calculations have been carried out by means of Clebsch-Gordon 
coefficients, while the present method employs the spin projection operators 
and the polarization operators. Because the parity non-conserving interaction 
has a parity conserving part, a partial comparison with the results of ADAIR 
and TREIMAN can be made. 


(27) R. K. ADAIR: Phys. Rev., 100, 1540 (1955). 

(28) S. B. TREIMAN: Phys. Rev., 101, 1216 (1956). 

(29) C. S. Wu, E. AMBLER, R. W. Haywarp, D. D. Hoppss and R. P. Hupson: 
Phys. Rev., 105, 1413 (1957); R. L. GARWIN, L. M. LEeDERMAN and M. WEINRICH: 
Phys. Rev., 105, 1415 (1957). 

(30) Venice Conference, 1957. 


440 C. FRONSDAL 


The most general direct interaction is given by the following matrix ele- 
ment (no attention is paid to constant factors, since the interest is in the 
distributions) 


K = Po(p2)(1 + 75) De --- Po Pn(s )Y1(P1)®(P1 — Pa) ; 


where y,, y, and g are the wave functions of the hyperon (spin n++#, 2-com- 
ponent s), the nucleon and the pion, respectively. Non conservation of parity 
is caused by interference of the two terms in the factor (1+ay;). The tran- 
sition probability, summed over the spins of the two fermions, is calculated 
by means of Eq. (2.9), and is found to be given by 


| K |? = Tr {ps «++ Pa Pn(8) O(p1) po «+ P2( pi — im,)(1 — a*y5)(ps — im:)(1 + ays)}. 
Using the subsidiary conditions, this can be reduced to 


1+ aa* ead a La* 
VI es MS ee Ta eee er, 


where 
T, = Tr{p,... Ps [] (82,4—8") O(D:)Pa ve Pa}; 


SAS {re n Sii — 8°) O(pi)Ps … Pepspays} | 
iL 

The first trace, 7,, is the parity conserving part of the angular distribution. 
The second trace depends on the relative signs of s and the 2-component of 
the nucleon momentum. This correlation between the directions of an axial 
vector and a polar vector is characteristic of a transition in which parity is 
not conserved. 

The quantities 7, and 7, can be calculated by making use of the explicit 
form for the spin projection operators given by (2.7) and (2.8). The results 
are as follows: In the rest system of the hyperon Eq. (1) reduces to 


LR 1 — aa* 
Da "Pasi ele ot: Ts + —— 


cate DIF 


(10.2) [KP n [Ba | 00s OT, 


where # is the angle between the hyperon spin and the nucleon momentum. 
For spin 3, 7, and 7, are equal to unity. 


For spin 3 


(10.3) my (7 al ver de 
tto \ 
= ila 30080 a 


J 
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For spin $ the results are 


Ur [17 97 225 x 
T,= =) A BE 3? 1 | DS (882 + 10) cos? 3 + 30 cost ‘à ‘ 
Aled ul: {49 1 225 
T= ci a | (2 8° 2 + aor (248? + 150) cos? d + i) SEGRE 7 


If a hyperon is produced in a scalar interaction involving a nucleon and various 
particles of spin zero, the component of its spin in the direction of flight is +4. 
If the two components of the hyperon beam—that is the spin +4 component 
and the spin —+ component—decay incoherently and are of equal strength, 
or if the decay conserves parity (that is, if a is zero), the transition proba- 
bility is given by 7, alone. This case was considered by ADAIR (27) and 
TREIMAN (28), Setting s? =+ in Eqs. (3) and (4), the result is: 


for spin 3 
T,=1 +3 cos Ÿ 
and for spin 3 


T,~1—2 cos? Ÿ + 5 cost? . 


These angular distributions are the same as those found by ADAIR and TREI- 
MAN. The more complete results of this work are contained in Eqs. (2), (3) 
and (4). 
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APPENDIX 


The connection between the spinor representation used by FIERZ and 
PAULI, and the tensor representation employed in the presen paper, is illus- 
trated by transforming the spin $ wave equations from one representation to 
the other. Even though the spinor representation is the more complicated 
one, it is useful for gaining new insight into the meaning of the subsidiary 
conditions, and the nature of the spin projection operator. 
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The van der Waerden spinors that form a basis for a spin 3 representation 
of the proper elements of the Lorentz group are of rank 3. There are four 
kinds: 4,59 @4,,, dii, and aiié, The improper elements of the Lorentz group 
transform d,,¢ into a4#2, a4,, into a4%,, and vice versa. Thus, to represent 
the Lorentz group the set a,,,, 4424 or the set a4,,, aië, is needed. These 
representations are reducible. The subsidiary conditions satisfied by the ir- 
reducible spin 3 parts are 


(A-1) dipes 4422, completely symmetric , 
(A-2) aA, = ali ’ ai 5, == abi, ’ 
(A-3) Pi di so =0, piaib, =0, 
where 


Pai a Pu Osa ’ 


and co,= iI, I being the unit 2-by-2 matrix. 

The first set possesses an advantage with regard to simplicity. The spin 
projection operator is simply the complete symmetrizer, and the redundant com- 
ponents may be completely eliminated. This advantage is off-set, however, 
by the non-existence of a first order wave equation connecting a,,, and ads, 
For this reason DIRAC, and all subsequent authors, have considered the second 
set. The wave equation is 


f M4 go = Pzp 0440, 


| mais, = pra, k. 


(A-4) 


DIRAC suggested introducing the electromagnetic interaction by means 
of the substitution 


(A-5) Py > Pu teA, . 
However, as pointed out by FIERZ and PAULI, this leads to serious difficulties. 
They therefore introduced two auxiliary fields, C, and Ci, with the help of 


which the free field equations (2), (3) and (4) can be derived from a single 
Lagrangian principle. There also follows from this that 


(A-6) O,=04=0. 


The substitution (5) may now be carried out in the Lagrangian, or in the 
Euler-Lagrange equations. Of Eqs. (2), (3), (4), (6) only Eq. (2) remains valid 
in the presence of an interaction. 

The free field Euler-Lagrange equations are 


mali, = 4(540% + 6264)(pia*,, + pic), 
MA y = 3 (6105 + OF0%)(p p02, + pC) , 
mO,=— pu 054 spin, 


Lire MOTI Os: 
mC; =—4p420, + ipfaib,. 


(A-7) 
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Introduce tensor indices by writing 
aâë, = oA, ab ee 


Eq. (7) then assumes the form 


ma, = (Mz (D "49 + pC”) , 
Mn, = (Yala (Peed? + D, On) 5 

mC, a —4p,; 05 va 3p" Aya 3 
mOi = —3piC,+3ip"ai, 


where 

Le A 58 B si\ _vF v oA Glen 
(Nude = 204 23 (05 OF +0; 0) 0 o = 40,03 + 40,50%, 
à On O4 Cod 


Next introduce the Dirac 4-spinors. Suppressing spinor indices 
A Au a A iC, 
ma (ea) 9= (vi): 
and introducing the y-matrices 
1 Orns 
Eee pÀB 
Va a i | O fr | ’ 
the wave equations finally reduce to 


| — ime, = (di — tu") (P% — PW); 


(A-8) 4 3 
| —imy =—3py + 40", 


Equations (8) may also be written 


(A-9) (DT, + im)x = 0, 
where | 
Pi i — Pi + 4YP 
Pa n x i — Da + £Y2P 
x=\% 1, p'T, = wee tu )p | — p,+ }ysp 
Pi a — pit typ 
2 3D, SP 3D°, sp°: — bP 


Equation (9) is of the same form as that obtained by Gupra (°). Although x 
has 20 components, as compared with the 16 components in Gupta’s theory, 
a considerable simplification is achieved in writing the J’), matrices in terms 
of the y-matrices. In Gupta’s theory the four matrices must be given expliticly. 
It should be emphasized that the three formulations of the Fierz-Pauli theory 
are completely equivalent. Equation (9) shall not be considered further, since 
a yet simpler formulation (Eq. (3.18)) exists. 


an yy r, ” 14 Ps er 
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Introduction. 


Since the introduction of analytical methods in theoretical physics the 
application of the geometrical attitude, as well as that of the different methods 
of elementary and differential geometry in physics, is one of the most im- 
portant resources of theoretical description. Of course, this does not mean 
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only the use of geometrical methods to explain explicitly the background of 
motions of test bodies in space and time, but also the different applications 
of methods of elementary differential geometry as e.g. is well known in the 
case of kinematics. However, these do not yet mean the « geometrization of 
physies » according to Riemann’s concept elaborated clearly by EINSTEIN in 
the case of his general theory of relativity. 

In the topic of theory of physical fields, as matters stand, there seem to 
exist two very different ways for the applicability of geometrical methods. 
One of them refers to the pure description of changes of the field excited in 
the space-time world in accordance with previous use of the geometrical 
methods; the other is the direct geometrization of the physical field as ela- 
borated in the case of Einstein’s theory of gravitation. To distinguish between 
these field theories of essentially different kind, it seems to be desirable to 
use the expressions: field theory of first and of second kind, respectively, intro- 
duced previously [9]. As mentioned above, in the case of a field theory of 
second kind the geometrical background is the four-dimensional Euclidian 
(respectively the pseudo-Euclidian) space and the physical fields are described 
by potentials (or by other quantities of the field) which are ordinary space- 
time functions. If on the other hand, the physical properties of the field are, 
according to the idea of Riemann, characterized by the geometrical structure 
of the space, the theory of the field is called field theory of first kind. From 
this point of view Hinstein’s theory of gravitation is e.g. a field theory of first 
kind; the electromagnetic and mesonic theories, respectively, are field theories 
of second kind. 

Since in the case of the field theories of first kind it has become obvious 
that the geometrical structure of the four-dimensional Riemannian space is 
determined by the gravitational field alone, many investigators have carefully 
been studying the possibility of some unified theory of gravitation and electro- 
magnetism. As these very interesting investigations of the past three decades 
have shown, practically all generalizations of the four-dimensional Riemannian 
geometry are suitable to serve as a geometrical basis of such a theory [11, 37]. 
Consequently numerous theories have been put forward, so that the bare re- 
cording itself of the various proposals is a real problem, even without taking 
theories based on geometries with more then four dimensions into account. 
In spite of these extensive investigations, a generally accepted final unified 
theory of fields has not yet been found. 

In the course of the investigations in the topic of unified field theory of 
physical fields mentioned above, quite modern methods of differential geo- 
metry have been used which seem to be adequate for other applications, espe- 
cially, in the case of field theories of second kind too. Therefore, in the fol- 
lowing the usual problems of the unitied field theories, as well as the geome- 
trization of fields will not be dealt with, but quite another point of view will 
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be put forward. Namely, some kind of geometrization of a given field theory, 
or field theories respectively, in the Riemannian sense will not be searched 
for, but, some field theories of second kind in general metrical spaces will be ela- 
borated; i.e. we shall regard the metrical geometrical space in which the field 
is excited as a generalized one, the geometry of which is determined by an 
arbitrary metrical ground tensor. 

Such investigations were known previously in the case of relativistic 
electrodynamics in a gravitational field, when the relativistic covariant for- 
malism of the Maxwellian theory was elaborated in the Riemannian space. 
However, the aim of these investigations is not a pure mathematical gene- 
ralization of the theories of second kind, but they are supported also from the 
physical point of view as can be seen from the following. 

The metrical point geometry of spaces is the geometrical model for isotropic 
spaces. If a physical field is excited in such a space, the components of the 
field depend only on the co-ordinates of the space-time world, i.e. are ordinary 
point functions. The geometry of the space is pseudo-Euclidian, or Riemannian 
according to whether the medium which is represented by the geometrical 
space is homogeneous, or inhomogeneous, respectively. However, if the medium 
is anisotropic, the physical properties of the medium are quite different in the 
various directions, therefore the quantities of the field depend not only on the 
position co-ordinates of the space, but in every space-time point on the direction 
co-ordinates too. This means that we have for every direction another dif- 
ferent Riemannian space as geometrical basis for the field theory of second 
kind; 2.e. the suitable geometrical basis for the physical field theory is a special 
kind of superpositions of Riemannian geometries. This heuristic idea is realized 
by the geometry of such line-elements spaces as the Finslerian geometry [4, 9, 35] 
or the geometry elaborated previously [10]. Hence, it is obvious that our 
proposition is of some interest from the point of view of the electromagnetic 
field theory in anisotropic (and inhomogeneous) dielectrica. 

To a certain extent an analogous point of view was put forward previously 
by W. BLASCHKE [2], as well as by O. VARGA [36] in the case of geometrical 
optics in anisotropie media. 

As a quite different application of the idea put forward above the recogni- 
tion can be regarded that the general line-element geometry is a natural geo- 
metrical basis for the classical bilocal field theory [10]. 

Now, in this paper we shall first elaborate quite heuristically a new rela- 
tivistic theory of electromagnetic field in dielectrica, as well as a model example 
of a classical bilocal theory of fields for the illustration of our general point of 
view. Then will be given a short review of the general classical theory of 
physical fields of second kind in general spaces elaborated previously, as well 
as the differential laws of conservation in such a space. 
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I. — Relativistic theory of the electromagnetic field 
in uniformly moving dielectrica. 


1. — The physical basis of the new geometrical point of view. 


According to Einstein’s special theory of relativity it is impossible to fix 
an absolute frame of reference based on any physical event; that is, all inertial 
systems are completely equivalent. This means especially in the case of the 
velocity of light that in all inertial systems this velocity is on the one hand 
isotropic, on the other hand, a universal constant. However, in the case of 
propagation of light in dielectrica this is not valid, since the velocity of light 
in the presence of a dielectricum is already different from that in vacuum: c/n 
(where e is the velocity of light in vacuo, and » is the index of refraction, 
respectively); furthermore, a natural frame of reference is distinguished by 
the presence of dielectricum, namely that in which the dielectricum is at rest. 
This circumstance is indicated by the special formalism of the theory formu- 
lated phenomenologically by H. Minkowski fifty years ago [18, 19]. 

Minkowski’s relativistic theory of the electromagnetic field in uniformly 
moving dielectrica is still an actual problem of field theory having some open 
questions such as, e.g. the problem connected to the energy-momentum tensor 
of the field [1, 7, 8, 12, 15-17, 21, 23, 24, 31], as well as the theory in the 
case of anisotropic dielectrica [13, 14]. The theory which will be put forward 
in the following has some interest not only from the point of view of the appli- 
cability of the theory, but renders possible to throw new light upon the old 
problem of the theory. 

The basic idea of our theory can be formulated as follows: The anisotropy 
of the propagation of light in anisotropic dielectrica, as well as in moving 
frame of references relatively to the dielectricum too, is a real physical fact, 
which cannot be left out of account. If we have, however, the advantage in 
mind which is assured by the elegant formalism of relativistic electrodynamics 
in vacuum by the constancy of the velocity of light in all uniformly moving fra- 
mes of reference, it seems advisable to endeavour to preserve in the case of dielec- 
trica the constancy of the velocity of light in every direction too, at least in the 
frame of reference in which the dielectricum is at rest. So it can namely be hoped 
that the internal structure of the field will become more evident. This attempt, 
of course, can only be realized, if the measuring of length is based on the use 
of light signals [29]. This means that instead of the original pseudo-Euclidian 
metric a so-called light-metric is introduced which depends directly on the 
refractional feature of the dielectricum. 


n = "tal % Li Pri VSS = tal el À es a. dei 
- » 
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Naturally, at this point the problem immediately arises whether the light- 
metric of space has a direct physical meaning—that is, it expresses the real 
geometrical structure of the dielectricum—, or is it only a pure technique of 
the geometrical method? This, however is a problem for philosphy which will 
not be discussed here in detail. Nevertheless, it may be mentioned that also 
the first version does not lead to contradictions and seems to be supported 
by an optical argument dealt with in the following. 

It is obvious that this method can also be interpreted—particularly, if the 
above first point of view is accepted—as an intermediate way between the 
two formulations of field theory of first and second kind, respectively. The 
space, the geometrical structure of which is determined by the light-metric, 
can be regarded as the geometrical model of the dielectricum derived by geo- 
metrization of the physical (i.e., that of the refractional) feature of the di- 
electricum in Riemannian sense. However, this does not mean that this theory 
is a field theory of first kind, since the physical field itself, which is excited 
in dielectricum, is not yet geometrized, but is described by different field quan- 
tities which are defined in the so-called model space introduced above. This 
model space is from the geometrical point of view in the case of inhomogeneous 
and anisotropic dielectrica a generalized one, in so far as its metrical ground 
tensor depends on the point co-ordinates, and in addition on the direction 
co-ordinates, respectively 

A further problem is, of course, whether such a distinguished partial geo- 
metrization can be carried out and how far the validity of the theory can be 
expanded. The difficulties involved thereby are that the index of refraction 
(respectively the dielectric constant) depends in some cases on the electric field 
strength. However, if dispersional phenomena are not taken into account and 
only weak fields are considered as in the case of the validity of classical electro- 
dynamics it is usual—which means that the mentioned constants of the di- 
electricum can be introduced—our above supposition does not lead to any 
contradiction. Therefore, in the following the supposition will be put forward 
that the index of refraction (respectively the dielectric constant) of the dielectricum 
does not depend either on the electrical field strength, or on time, but is a function 
of the space co-ordinates only. 


2. — Geometrical characterization of the model space. 


In order to carry out our programme analytically initially it is assumed 
that in the four-dimensional pseudo-Euclidian space-time world a frame of 
reference is given in which a dielectricum is at rest having the index of re- 
fraction n. The geometrical structure of the space-time world is defined by 
the metrical ground tensor y,, the components of which are constant and 
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e.g. have following numerical values: 


(2.1) Yoo = — Vu = — Var = — Y33 = 1, Vuv = 0 (u SELV). 


The co-ordinates of the special frame of reference introduced above are 
denoted by é“ (*). If the dielectricum is an inhomogeneous one, the velocity 
of light depends on the space co-ordinates of the dielectricum, i.e. c— 
= c(£t, &, &°). Consequently, the time co-ordinate is & = ¢(&1, €, £*)-t, t being 
the time of reference. 

In the case of inhomogeneous and anisotropic dielectrica, however, the 
dielectric constant, as well as all quantities of the field, depend on the one 
hand on the position co-ordinates of space and on the other hand in every 
space-time point on the directions too. 

In order to use the common terms of modern geometry the concepts line- 
element space, as well as the fundamental element of the line-element space ought 
to be introduced. 

The fundamental element of the line-element space is a line-element defined 
by its four-dimensional space co-ordinates x" and by a contravariant vector v" 
the direction of which corresponds to that of our line-element. Since only a di- 
rection is defined by the vector v", it is evident that the components of v” 
are not independent fulfilling the relation 


n à 
vv, = const, 


only their proportion having a meaning. The ensemble of the fundamental elements 
(x, v) is the so-called line-element space (+). 

Nevertheless, let us assume that, as above in the case of homogeneous and 
isotropic dielectrica, for inhomogeneous and anisotropic dielectrica too the 
velocity of light depends on the spatial co-ordinates, as well as on the spatial 
directions of the four-dimensional space-time world of the line-elements, 
Meet, 2. ye), 

Finally, in the case of homogeneous and anisotropic dielectrica (like e.g. in 
crystal optics) the velocity of light depends only on the spatial directions, 
that is c= c(v1, 02, 0). It is clear, that from the geometrical point of view 
the adequate space is a special kind of the line-element space introduced above. 

Naturally, in the course of the above considerations the dependence of 
light on the co-ordinates x", as well as on the directions v“, respectively, re- 
presenting the inhomogeneity, and the anisotropy of the dielectrica, can be sub- 
stituted by the dependence of the dielectric constant and of the magnetic 


(*) The convention that greek indices run over the values 0, 1, 2, 3, and the italic 
ones over the values 1, 2, 3 will be used. 
(+) æ and v are abbreviations for x“ and v“, respectively. 
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permeability on the co-ordinates and on the directions too. 

For the sake of simplicity and to illustrate clearly the basic idea of the 
theory to be elaborated it should transiently be supposed that the magnetic 
permeability of the dielectricum is given by # =1 and the dielectric constant 
by «= e(&, &, &). Hence, based on the Maxwellian relation e — G/VE, where 
eo is the velocity of light in vacuo. 

Hence, the time co-ordinate & = (co/ve)t is not independent of the space 
co-ordinates é!, &, &, it is clear that on the one hand this co-ordinate system 
is not a natural one and on the other hand the question arises whether the 
space itself still can remain pseudo-Euclidian. Both difficulties can be solved 
by introducing a new system of co-ordinates with the transformation of co- 
ordinates: 


(2.2) o = Ve(Et, 8,8), =, (e>1), 


possessing the inverse transformation 


E = a[/Ve(81, E°, 8), gk = ak, 


Namely, in this manner an adequate system of co-ordinates is found where 
x — ct, as is familiar in the case of the usual co-ordinate system in vacuum. 

It should be mentioned that the transformation of co-ordinates (2.2) is, 
naturally, not a Lorentzian one, hence the determinant of the transformation 
does not fulfil the well known condition. 

To determine the components of the metrical ground tensor in the new 
co-ordinate system, that is, to give explicitely the metrical fundamental ten- 
sor of the model space mentioned in the previous paragraph, the well known 
formula of transformation of tensors 

de” OEP 
wr Gal da ab 


has to be used deriving for the covariant components: 


10, VE 1 
= Ap! 200 + VE 


1 aed 1 lo lo) 
Jim “<< vis OVE (70): "Yo —~ @ (yon Se VE =f Vio ve Som) + Yim 


al 
(2.3) Joo = 5 Yoo » Joi = 


e Qui dam dei da 


and for the contravariant components 


ze _ de 
DEA: 00 — 00 0 at 
; | oe 
(ple = gy, g"° —_ ve o tn pes ee È (év)y* N 
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respectively. Finally, 
def A, 
(2.5) godetlg,l= =. 


The structure of the model space is perfectly determined by the metrical 
fundamental tensor. The Riemannian curvature is e.g. given by 


R= te“{e + 1e-*(2°)?(0.e)(0°)} [(0 E)(0"e) ? — e ta0(de){Rax AE 
where 
e v e 0 
oe gl Oy 5 Oy a 
are and R,,— È,.,, is the Einstein-Ricci tensor of the space. 


In the course of the above considerations a method of construction for the 
model space starting from the pseudo-Euclidean space with the metrical funda- 
mental tensor (2.1) was elaborated. This method of construction is, however, 
a quite general one, in so far as that the formulae (2.3)-(2.5) also preserve their 
validity for a general metrical fundamental tensor y,, = y,,,(0), or y,,,= y,,(#, ©), 
respectively, where again vw are the co-ordinates of direction in the space- 
time point x“ In the case of a gravitational field e.g. y,, = y,,(7) and is de- 
termined by the distribution of matter in the space-time world. In the presen- 
ce of an electromagnetic field without gravitation the metrical fundamental tensor 
of the original space is determined by the energy-momentum tensor of the 
field taking Eïinstein’s well known equation of the general theory of relativity 
into account. In spite of this, as a first approximation, let us suppose that 
the metrical fundamental tensor of the starting space is defined by (2.1), hence 
it can be assumed that the influence of the electromagnetic field—being in 
accordance with our previous supposition a weak one—is negligible (*). 

Furthermore, in the following, at the first start, for the sake of additional 
simplicity it should be supposed that our dielectricum is a homogeneous one 
having a constant index of refraction and dielectric constant, respectively. 


(*) Without going into details it should be mentioned that—as second approx- 
imation—it may be assumed that 


= [V1 4+ 27/0, = uV1+2yle, 


x being the time-component of the dynamical gravitational potential satisfying the 
equation 
oo = 1+ 2z/e*, 


where go is determined by Einstein’s equations [21]. 
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In this case the equations (2.3) and (2.4) are reduced into 


(2.6) Joo = : > Gu=In=Js=—1, g,—=0 (u A?) 
and 

(2.7) =e, 0) gigi ag NN ge =o (U#7) 
respectively. 


It is obvious that the space having the metrical fundamental tensor (2.6) is 
also a pseudo-Euclidian one, however, the metric is defined by light signals. 
Hence, the velocity of light in a dielectricum with a constant dielectric con- 
stant is c= co/ve, the metrical fundamental form is given by 


(2.8) ds? = F*(x, de) g,,dafde° = > (x)? — [(da1)? + (dx?)? + (dx*)?] = 


= À at — (de) + (da)? + (de), 


as was expected. 

The transformation (2.2) of the co-ordinates is in this case a pure propor- 
tional transformation of the gauge along the axis of time. 

Now, it should be mentioned that the previous considerations are valid, 
so far, in the special co-ordinate system Æ in which the dielectricum is at rest. 
To calculate the metrical fundamental tensor in a new frame of reference K' 
moving uniformly with the velocity w = {tv}, m,,1v,} relative to K the proper 
Lorentz-transformation 


(2.9) dr“ — at da" ; ata, = 0", , det|a“ | = +1 
having the matrix of transformation 


x — xiv.fe — x0,/¢ — xw,/¢ 
—x,/¢ 1+(%—1)wi/w? (x —1)10,10,/10? ( x — 1), 10,/10? 
— xtv./e (x — 1)W,1v0, /10? 1+(x—1)m}/m? (x — 1)tv,t03/0? 
— 03/0 (x — 1)10,10, /m0? (x —1)w,10,/10? 1 - (x — 1)t0?/t0? 


ad“ — 
% = {1 — w/o}? 


should be used; finally, for the contravariant components of the metrical funda- 
mental tensor of the model space 


, e—tv?/e? 3 Tre #07 em a 
Dre 1 ife? GO nz gie I _ 1L—atv?/c*\ mm 
1 —w?2/ce? ww? ? 


is obtained. 
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It is obvious, of course, that the formulae (2.3)-(2.9) at the limit « +1 
—i.e. in vacuo—become reduced to the well known ones. 

The above method of construction renders possible, in principle, the ex- 
plicit determination of the metrical ground tensor of the model space, if the 
dependence of the dielectric constant on the spatial co-ordinates is known. 
Occasionally, the practical carrying out of this procedure is quite simple, on 
other occasions it is, however, complicated, inasmuch as the explicit dependence 
of the dielectric constant in cases of inhomogeneous and anisotropic dielectrica 
is not known. This problem will be discussed in the next paragraph too, never- 
theless, now another method directly applicable, at least, whenever the di- 
electricum is homogeneous and isotropic, will be given which is, however, 
also from the theoretical point of view of scme interest. 

By means of a covariant vector v”, introduced above for all directions of 
the space, a scalar fundamental function 


(2.10) (ar, v) e E (00)? — sure}, 


can be defined. It is easy to see that on the one hand, the measuring of the 
length can be characterized by %, hence 


ds == %(#, dx) 
and on the other hand, the fundamental function 3 has the following property 


N OR? 
(2.11) Iuv FA 9 Ov" Ov” ’ 


i.e. the metrical fundamental tensor is determined by the introduced funda- 
mental function. However, it should repeatedly be made perfectly clear that 
this last relation is only valid in the case of dielectrica having a dielectric con- 
stant independent from v. 

The relation (2.11) is characteristic of the Finslerian geometry [4, 9, 35], 
consequently the geometry of our model space is a special case of the Fins- 
lerian one. 

Often, also in the case of an inhomogeneous, as well as an anisotropic di- 
electricum it is theoretically possible to introduce such a fundamental function. 
Generally, however, an adequate geometrical basis for the field theory is, when 
such a fundamental funetion cannot be introduced, the general line-element 
geometry [10] being an immediate generalization of the Finslerian one. 
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3. — Indicatrix of space and Fresnel’s ellipsoid of the dielectricum. 


To characterize the structure of the model space at a point P(x), let us 
consider all directions crossing over P. The end point of the unit vectors di- 
rected towards the different directions starting trom P form a hyper-surface 
in the four-dimensional model space having the equation 


sa) Fa, 8) =1, 
where 


x 


is the vector of unit length in the direction vw“. This surface is the so-called 
Carathéodorian indicatrix of the space at the point P. 
Let the indicatrix (3.1) be cut by the hyper-plane 


(3.2) E=VI+e, 


then the equation of the surface of intersection is given by 
pet gs! 
(3.3) On §*E" antes. 


being just the Fresnel ellipsoid of the dielectricum. 

This is, otherwise, obvious from the method of construction and a direct 
consequence of the introduction of the light-metric in our model space. 

The correspondence between the indicatrix of the space and the Fresnel 
ellipsoid of the dielectricum in the case discussed above is quite trivial, but 
in the general case for the construction of the metrical fundamental tensor of the 
model space it is of good service to us. Namely, e.g. in the case of a homo- 
geneous, but anisotropic dielectricum the problem to determine the indicatrix 
in such a way that its cutting with a suitable hyper-plane should be 


3.4) en6 Et =1 


may be put forward, where e, is the known dielectric tensor of the dielectricum. 

In order to carry out this programme we should start from the assumption 
that the anisotropy of the dielectricum is characterized by the functional 
dependence of the dielectric constant on the spatial direction co-ordinates, 
i.e. €=e(v). According to the method of construction elaborated in the pre- 
vious paragraph for the components of the metrical fundamental tensor of the 
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model space the implicit formulae 


1 
We)? Jui = 92 = y = —1, Jw=9 (uv), 
can be obtained. With the help of this metrical fundamental tensor the fun- 
damental function 


fi STR) 
(ov) = yo )2 DE 

ai ) © ( ) ORTO, nf ’ 

may be introduced and the corresponding Carathéodorian indicatrix of the 
space is given by 


1 2 VER — 
a(v) ©”) — Ogee = F. 


By cutting this indicatrix by the hyper-plane 
= V1+ ev), 


the equation of the surface of intersection 


ni 
(3.5) Ob È = e(0) ’ 
is obtained. On the basis of the construction elaborated above in the isotropic 
case this surface must be the Fresnel ellipsoid of the dielectricum. Therefore, 
the equations (3.5) and (3.4) may be set side by side. In order to render pos- 
sible the determination of the explicit dependence of e on v, the ellipsoid (3.4) 
will be cut by the straight line 


sn 


where the direction of the straight line is determined by v* and 7 is a para- 
meter. Hence, by substitution 
(né Er = 1 


is obtained, and the co-ordinates of the point of the ellipsoid (3.4) are given by 
Er — vk life, vo}. 


Nevertheless, the same Fresnel ellipsoid is represented by the equations (3.4) 
and (3.5), therefore, 
: 1 Ono 


sg ? 


e(v) et wv 
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is obtained. Consequently the fundamental function of the model space is 
given explicitly by 


dx 0 VX 


d(0) Di ao nel. 
le,o"e cet 
In geometry usually such a space is called Minkowskian being a special case 
of the Finslerian space [34]. 
The metrical fundamental tensor of this Minkowskian space is determined 
by (2.11) and is given by 


ayiggk }2 

Joo = a pe Join di PURE se Col iO 

In this manner, the geometrization of the dielectric (respectively the refrac- 
tional) property of the dielectricum is carried out and we can proceed to the 
elaboration of the theory of the electromagnetic field in the model space intro- 
duced above. The theory, however, will only be discussed, in the homogeneous 
and isotropical case, the formulation of which is quite simple, well perspicuous 
and very suitable for direct comparison with the previous theories. 


4. — Theory of the electromagnetic field in dielectrica with dielectric constant 
e = const. and magnetic permeability u = 1. 


In a dielectricum with the dielectric constant «= const. and with the mag- 
netic permeability w — 1, è.e. in the case of a homogeneous and isotropic di- 
electricum with vanishing magnetic susceptibility, the metrical fundamental 
tensor of the model space is obtained by (2.6). 

The quantities of the electromagnetic field are derived from the four- 
potential ®,—{—, A} fulfilling the D’Alembert equations 


where 
€ 


Dit 98,0, = {4 + 0 + 0}= 20 4, 


0 © 
and s“— {o, î/c,}, denoting pg and A the scalar and vector potential respectively ; 
furthermore, o and à being the macroscopic charge and current densities. 


The potential P,, as usually, has to fulfil the Lorentz condition 


(4.2) 0, Dl D 
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Attention should be paid to the fact that the contravariant component of 
the current-density vector is defined directly by g and i—as it is usual— and, 
therefore, its covariant component occuring in the equation (4.1) is obtained by 


(4.3) Su = gass = {o/e, — 1/0}. 


In the classical theory of Minkowski the electromagnetic field in dielectrica 
is characterized by two antisymmetrical tensors F,, and G,,, respectively, 
having the components 


(4.4) C= {Fos Foo; Fy} 3 D —={F, Fa, Fe} = ba) 
and 


ples {Gro, Goo, Go} ; j= {Gos Gas Gyo} ’ 


where €, D, $ and $ denote the electric field strength, electric displacement, 
magnetic field strength and magnetic induction, respectively. 

In the theory elaborated here, however, the electromagnetic field is des- 
cribed by a single field tensor derived from the four-potential by 


(4.5) FP, Dro Di 

having the covariant components (4.4). But its contravariant component is 
defined by 

(4.6) PH = Gp, 

that is, owing to (2.6) 

(4.7) DSL VESTO i on i EE 


are obtained. 

Already at this point the advantage of the new geometrical method may 
be seen. The intrinsic structure of the electromagnetic field is expressed far 
more elegantly and naturally, furthermore, the formulation of the theory be- 
comes more simple. 

Owing to definition (4.5) for the field tensor F,, the identity 


(4.8) 0,1, + CPE aim 0,7, = 0 
is obtained corresponding to the second group of Maxwell’s equations. 
In order to carry out the deductive formulation of the theory in the case 


of empty space (i.e. o —0, à—0) the Lagrangian 


(4.9) LEE, FF = 4 (89 — CD) 
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is introduced being formally, however, only formally, equivalent with the 
Lagrangian of the field in vacuo (*). This means that if the model space is 
introduced the field theory in dielectrica represents a direct generalization of 
the theory in vacuo. 

If s“ does not vanish, the Lagrangian of the field is given by 


(4.10) L= e E a ce Dia” 5 
where 


1 
Ps" = (0/e)p + — (A) 
0 


represents the energy of interaction between field and macroscopie charge 
and current densities, respectively. However, the charge density occurring in 
the Lagrangian is the density of the so-called free-charge 0/e, as was expected. 

As is well known the field equation can be deduced by variation of the 
integral of action 


(4.11) ves [visi Late, 


v 


Q 


varying the potentials ©, for the fixed region Q of integration, subject to the 
restriction that the variation of ®, and its first derivatives at the boundary 
of the demain of integration vanish. 

The integral of action given by the definition (4.11) represents the general 
case. If the space is, however, the special one with the metrical ground tensor 
(2.6) e — const, the volume integral of Z is again an integral invariant. 

By carrying out the variation in the usual way the field equation 


(4.12) 0, Pi — gl 


is obtained, representing the first group of Maxwell’s equations. 
Based on (4.12) 


(4.13) 2,54 = 0 
is cbtained representing the so-called equation of continuity, that is the dif- 
ferential law of conservation of electric charge. 

Now, let us interrupt for a few moments the elaboration of the theory and 
return again to the problem of determination of the metrical fundametal tensor 


(*) Naturally, for the limit £ +1 the whole formulation of the theory is exactly 
equivalent with that in vacuum. 
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of the model space. Namely, on the basis of equation (4.6) the mentioned 
problem can be solved very strikingly. This new method has the advantage 
that it makes obvious that equation (4.6) is essentially a new covariant for- 
mulation of the constitutive equations determining the well known relations 
between the electric field strength and the electric displacement vector, further- 
more between the magnetic field strength and the magnetic induction vector, 
respectively, in four-dimensional language. 

Let u* be the four-vector of the velocity of the dielectricum having thus 
in the rest system of the dielectricum the components {c,, 0, 0, 0}. Then, in 
the Minkowskian theory the classical constitutive equations of Maxwell’s 
theory 


Dee, DE VB, 


al 
bb 
can be given in the form 


Gigi = Eat 
di 
By contraction of the second equation by wu 
il 
au Uy sa Ga ug + Gg = an uu, + B,,W ug + ci Fg}, 


and finally 


Gre oe ii J, popo at Eu sd sli [uP ue su uP y% u°]| noe gr 
n e ec 
Lal | po Be de st. Lu” pÉ ue + bye] Bosa 
n ä J 
is obtained. Taking into account that F,, is antisymmetrical in its indices, 
the coefficient of La is reducible and 


1 eu —1 , o EU— 1 9 
(4.14) Get = (pe ut (of FE et) Fa 
0 


Co 
is obtained. Now, let the equations (4.6) and (4.14) be set side by side, then 
it becomes obvious that between the metrical fundamental tensor g°Ÿ of the mo- 
del space and the metrical fundamental tensor y* of the original pseudo-Euclid- 
ian space and the constants of the dielectricum e and p, respectively, the relation 
exists: 


bat et Ls A 
ap __ ab B 


0 


460 J. I. HORVÂTH 


Apparently, whenever x — 1, g*’ is equivalent with the metrical fundamental 
tensor introduced above. 

On the basis of the above considerations it seems possible that also in the 
case of homogeneous and anisotropic dielectrica the same method is suitable 
for the determination of the metrical fundamental tensor of the model space. 
However, this is unfortunately not the case, or, at least, the investigations in 
this direction have so far not produced the expected result. N evertheless, the 
method will be discussed in this case too, since it, as well as the detail-results 
are also interesting in themselves. 


The problem will be discussed in the case when from the electric point of 


view the dielectricum is anisotropic having the dielectric tensor ex, but, from 
the magnetic point of view homogeneous, that is 4 = const. Then, the consti- 
tutive equations of Maxwell’s theory 


: 1 
Di= TE, S =-8, 
k=1 (2) 


can be given in four-dimensional language in the following form 


(4.16) (Gy 5 + ee Fue = ()’. 
and 


1 
(4.17) Gue Gof 4 Greet = {Pe fo + peut + PAPER E DOP 
IT) 
respectively, 4° being the above introduced four-velocity of the dielectricum and 
e0—1, 8% — 0 eit — or 


are supposed. By contraction with Ya" taking (4.17) and (4.16) into account 


Paces iat | 
CF u'u* = ey Ty? + cè G pa ; {feu a ae vue ur da GP YVES x 


is obtained, therefore 


(a? 1 x (03 1 (a! a x Le 
(4.18) gr — ayer ta LOUE — y*?)u® — (ele — yPeyy* jar ee, 
0 


is the expected relation between the two antisymmetrical tensors of the field. 
Now, the equations (4.18) and (4.14) can be set side by side, however, the 
coefficients of P,, seem to be irreducible against such a factorization carried 


out in the case of (4.14). Therefore in this way the introduction of a metrical 
fundamental tensor is hindered. 


da 
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So, in this case for the construction of the metrical fundamental tensor of 
the model space only the method elaborated in the previous paragraph is pos- 
sible. However, the close analogy with the classical theory is lost. 


5. — The electromagnetic field in an arbitrary homogeneous and _ isotropic 


dielectricum. 


Now, the considerations of the previous paragraph should be generalized 
and the meaning of the deduced formulae elaborated in three-dimensional 
language too. 

In the general case of isotropic dielectrica and paramagnetic dielectrica, 
that is, for e = const. and w = const., respectively, the contravariant component 
of the metrical fundamental tensor of the model space is given by (4.15): 


Pie ies) og argh gi 1i4/u ye gh = 0 (u 7»). 


It can be easily verified, that for the covariant component of the metrical funda- 
mental tensor. 


(5.1) oo = lfeVu , $11 = Jn = Jas = — VU; I wv = 0 (uv) 


and for the determinant 
g = detlgl=—£, 


is obtained. 

By carrying out the Lorentz transformation (2.9), the components of the 
metrical fundamental tensor of the model space in the frame of reference mo- 
ving with the velocity tv = {{0,, t0,, 3} uniformly relative to the dielectricum 
are given by 


A IG Os ee o di 
00 A Pa “| 
e mo EVU; I 1 mile 


MD, 
eVUT; 
Co 


= = Oi ty , to al 1/eu — 102/65 
JU cr a * lhe, 


0? leu 1 — 2/6 


In this manner the electric and magnetic anisotropy of the model space ori- 
ginating from the velocity of the frame of reference is determined. 
The covariant component of the field tensor is again defined by (4.5) and 


(4.4), respectively, and for its contravariant component owing to (4.6) 


ye {Fi Fo, Fo}, H = {pe, F31, Fr 
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is obtained. Equation (4.6) is equivalent with the constitutive equation (4.14) 
of the dielectricum. 
The Lagrangian of the field in empty space is given by 


(5.2) Lo “e QI ie == 3{DO La ED} 
and, if the four-current density does not vanish, by 
(5.3) L= {FE Det, 


respectively, which are only formally equivalent with (4.9) and (4.10). 
The D’Alembert equation of the potentials has the form 


(5.4) 00, = 95, + 92,2,0, = VE d0,- 1/FAD,=—a,. 
0 e 
But 
8,= 9,,8°= {olevi, — Vale}, 
therefore, in three-dimensional language the components of @, must fulfil 
the equations 


Lo N 
Ap dtp =— gle, AU — TE AU = — pifes. 
0 ‘0 


Similarly, field equation (4.12) represents the Maxwell equations 


ni 
div® = 0, curl $ — AD = i/c0, 
‘0 
as was expected. 


The D’Alembert equations for the potentials are compatible with the field 


equations (4.12) then and only then, if the potentials ®, fulfil the Lorentz 
condition (4.2): 


3 ni . VU 1 
0,D = 9g 0D, = vi CAR Pr VAS + CD, + 03D, } == 0 5 
being adequate in three-dimensional language with 
div A += ap =0. 
0 


The equation of continuity (4.13) preserves its formal validity too, however, 
in three-dimensional language it has the meaning 


0,0 +divi= 0. 
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The results of the two last paragraphs can be summarized as follows: 
the formalism of the electrodynamic field in vacuo preserves its formal validity 
in homogeneous and isotropic dielectrica too, if the new geometrical aspect is intro- 
duced. Naturally, it is true that at first sight, even if the complete formalism 
of the theory is more simple than the previous Minkowskian one, these results 
seem only to be formal, however, as will be shown in the next paragraph, it 
also has its advantages from the physical point of view. 


6. — Differential law of conservation. Energy-momentum tensor of the field. 
Four-force density. 


As is well known, based on the infinitesimal transformation of co-ordinates. 
(6.1) aM = at + 4 EM (x) 


(7 being an infinitesimal parameter and £(x) an arbitrary covariant vector 
which is a continuous and limited function of the co-ordinates x), some iden- 
tities can be deduced which from the physical point of view can be interpreted 
as the differencial laws of conservation of the field [25]. 

This method is very familiar in the topic of field theory and has previously 
been discussed in detail [7, 8], furthermore it will be dealt with too in the 
course of the elaboration of the general theory in Chapter IV; therefore, it 
will be elaborated here only in broad outlines. 

If the variation of the Lagrangian density brought about by the change 
of co-ordinates is investigated the explicit dependence of the Lagrangian on 
the covariant components of the metrical ground tensor g*’, on the four- 
potential ®, and its derivatives ®,, and on the current-density ./|g|s%, res- 
pectively, must be taken into account. Owing to (4.10) it may be presumed 
- that L has the explicit form: 


(6.2) L= V9) 49° 9" FF D8°} = Vig |L(9*"; ©; ®,, 8°). 


Assuming that the variation of g%Ÿ, ®, and s*, respectively, is carried out by 
arbitrary, as well as independent ways, the total variation of I—introduced 
in (5.3)—subjected to the restrictions that the &—gs of the infinitesimal 
transformation (6.1) at the boundary of the domain of integration 2 vanish, 
is given by 

ol = | d* d'or, 


Q 


where d*L is the so-called local variation of &: 


sete L'(x) — Aa), 
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that is, 
OL seta a Oli sts 
se* = Ld*(v g/l) + Via {ga do i +x 82+ 35, 30, + p 


and, finally, by partial integration 


3 —{1[ 0L es OL CI PRA OL “es 
on [Viol eu sat [sg HE — ooo d*D, +3 88" dix , 
Q 


is obtained. Based on the explicit form of the Lagrangian (6.2), we have 


ar = [vidi ae, Pt — GE) + Ha 8.871349? — 

à — (2, F* — s*]3*®, — D,5*s"}dr. 
Introducing the tensor 
(6.3) T,g=— (Pato GREAT 


and considering the field equation (4.12), as well as the relations (Sect. 13): 


S¥g*? = — nf(2,g°*)E* — (0,87) — (0,E8)g%} + O(n?) , 
d*s* = — 7{(0,8*) E (0 ,&")8 )s”} + O( n°) 


by partial integration and on the basis of the definition (4.5) of Pas 
ois nf igi {OT + Fs dix, 
Q 


is obtained. However, / is an invariant of the arbitrary changes of co-ordinates, 
therefore, it is stationary: 


d1=0. 


This is an identity for arbitrary &*-s and for arbitrary choice of the integration 
region, thus we have finally 


(6.4) — 0,T} =F,,8. 


It is well known that the ponderomotorie four-force density introduced by 
Lorentz in the electron theory is given by 


(6.5) kit Ps = È (ie), — Les += (i <2) À 
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In ponderable matter with e and w different from 1 it is not easy to find à 
unique expression for the force density acting on matter [21]. However, ac- 
cording to the above equation (6.4) it is obvious, that k, is the tensorial diver- 
gence of the tensor 


Tt: 


(6.6) i= CHAR ti ii 6 


Based on elementary calculations for the pure spatial components of this 
tensor 


T,. = E;D, + B,H, —+4 6,(ED + BH) 


is obtained, being in the rest system identical with Maxwell’s stress tensor 
in ponderable matter. Further, 


1 
{Tors TNT ota age eS = S, 
0 


where © is Poynting’s vector, and 


In the rest system © and h are identical with the usually recognized expres- 
sions for the electromagnetic energy flux and energy density in stationary 
matter. Finally, 


eee 1 Ts} = —(DxB)= CÀ 


The form of equation (6.4) suggests that the left-hand side of (6.4) is the 
ponderomotive four-force density of the electromagnetic field k, and that 7,7, 
as given by (6.6), represents the electromagnetic energy-momentum tensor. 
This would mean that the quantities ©, h, g in every system of co-ordinates 
should be interpreted as the electromagnetic energy flux, energy density and 
momentum density, respectively. The above expressions for ©, h and q are 
due to Minkowski; in vacuo they reduce to the corresponding expressions of 
the electron theory. This means, however, that the above equations (6.4) 
can be interpreted as the law of conservation of energy and momentum in 
homogeneous and isotropic ponderable matter. 

Minkowski’s electromagnetic energy-momentum tensor satisfies the same 
identity 


(6.7) = Ui, 
as the tensor of the electron theory, but it is obvious that 7’7+7'". This 
«non-symmetry » of Minkowski’s energy-momentum tensors has given rise to 


a long discussion in the literature and it was generally felt that this property 
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represented a real difficulty for Minkowski’s theory [21]. Abraham, there- 
fore, tried to construct a symmetrical expression for the electromagnetic energy- 
momentum tensor satisfying Planck’s general relation 


(6.8) g=—=S, 


in the rest system. 

This difficulty, however, does not occur in the theory elaborated here. 
It is, namely, meaningless to ask for the symmetry properties of the mixed 
component of the energy-momentum tensor, but its pure covariant, as well 
as the pure contravariant components, are obviously symmetrical 


a al PA N = ig La =9,,T 3 =T,, 


fulfilling Planck’s relation (6.8). 

Furthermore, Minkowski’s energy-momentum tensor satisfies Laue’s eri- 
terium of radiation [12]. This means, however, that the same energy-momentum 
tensor of the field fulfils both important physical criteria of the theory, if it 
is supposed that the electromagnetic field is induced in the introduced model 
space representing by its geometrical structure the dielectric and paramagnetic 
properties of the dielectricum. 

Finally, a further problem should be dealt with briefly. Let a point-like 
test body with the rest mass m, and electric charge e be given, d.e.: 


e=fotr) )os(c'—r)d*xz; (dr = da'da*dz*), 


6,(v) being the three-dimensional. Dirac’s 6. Hence, onwing to (6.5) 


1 + 10 : 
we les + = xxl, 


the equation of motion is given by 
dr ETRO : 
Mo gr = Tae = is (ixB)}, 


with s“— o(dx“/ds), as is usual in the electron theory. Now, the presence of 
the factor 1/vV4 on the right-hand side of this equation should be justified. 
On the basis of the definition of the metrical fundamental form 


1 
ds? = Aa“ da" = — 0 aS 
dada” = (dar) — /RI(et)? + (dat)? + (da*)?] — 


di en[(de'\® (de. fdx*\2 ci 
ae - odi | PESA 
Dre ci (eh es Ce) 


eva IT 
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is obtained, where drt denotes the invariant proper time interval. However, 
the usual relation between the infinitesimal distance and the proper time in- 
terval is given by 
Co 
ds = —— dr’. 
ven 

Therefore, it is obvious that in our case the gauge of the time unit is reduced 
by the proportion Vu. If we want to use the usual definition of the proper 
time interval, it must taken into account that 


dt’ — Vu dr . 


The reason of this abbreviation of the unit of the time intervals is due to the 
fact that in the case of dielectrica with 441, in order to derive the metrical 
fundamental tensor (5.1), the transformation of co-ordinates corresponding to 
(2.2) must be substituted by 


w= Venu a = (pe. 


Therefore, the correct equation of motion is given by 


D UMR | ie da | 
(6.9) ron à (È 8), 


as was expected. 


7. — Discussions. 


The programme of the above considerations was to elaborate the theory 
of the electromagnetic field according to the suggested new geometrical method 
in a quite heuristical way, in order to throw a strong light upon the general 
idea proposed. This elaboration of the concepts step by step was somewhat 
lengthy, however, it was necessary to make clear the physical and geometrical 
background of the method. Now, in this paragraph the invariant-theoretical 
basis of the elaborated theory will be briefly discussed too. 

Let us consider the conformal transformation of the metric of the space 


I! 
Ip > Iap = CG ap» 
where o =o(#) is a scalar. Taking the infinitesimal conformal transformation 


dg da: gb — g°P = gv? ox 
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and owing to (5.3), (6.3) and (6.7), for the variation of the Lagrangian 


N ax x} > od RIN ap Soa 
32 = gaa ld =5 Vi gl Tap9 dx = 0, 


is obtained. This means that the Lagrangian (5.3) is conformal-invariant [32]. 

Bearing this property of the electromagnetic field in mind, it becomes 
obvious that by the change of the metric of the space, where the field is in- 
duced, i.e. by the introduction of the suggested model space, the field equations 
and the whole formalism of the theory of the electromagnetic field will not 
be varied. This is the direct reason why the formalism of the theory of the 
field preserves its elegance being so suggestive in vacuum. 

Finally, two conclusions should be drawn concerning the above conside- 
rations. Owing to the introduction of the model space, it becomes obvious 
that for the case of the dielectricum the exclusive natural generalization of 
the energy-momentum tensor of the electromagnetic field in vacuum is the 
Minkowskian one defined by equation (6.6). Furthermore, it follows from (6.7) 
that the important conclusion of the theory in vacuum, namely: the vanishing 
of the rest mass of photons, preserves its validity in dielectrica too, on sup- 
posing that the electric and magnetic property of the dielectrica is represented 
by the geometrical structure of the model space. This is a very remarkable 
conclusion of the theory being in contradiction with the opinion and results 
of previous authors [22]. It is, however, needless to say that this is an imme- 
diate consequence of the introduction of the light-metric in the model space. 

Further conclusions of the elaborated theory, as well as the application of 
the method in the case of anisotropic dielectrica discussed above only in broad 
outlines, will be treated elsewhere. 


II. - A model examp'e of classical bilocal theory of physical fields. 


8. — The basic idea of bilocal theory of fields. 


The concept of bilocality of physical fields was introduced by Yukawa [38]. 
Since then, several authors have dealt in detail with this very interesting theory, 
particularly, from the point of view of the theory of elementary particles [27 , 28]. 

The general idea of the physical field theory suggested originally by Faraday 
and Maxwell was that—instead of the concept of the point mechanies ac- 
cording to which the action of the forces is an action at distance — the inter- 
action between to separate particles is transmitted by the physical field. This 
means that the change of the state of the field in a point of the space-time 
world depends only on the changing of the state of the stress of the field in 
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the immediate neighbourhood of the considered space-time point. If we take 
only the action of the state of the stress of the field in the infinitesimal neigh- 
bourhood of the considered space-time point into account some field equa- 
tions can be deduced by the well known limiting process for the characteri- 
zation of the balance, which are owing to the above considerations partial 
differential equations. The theory of the field is in this case an ordinary local 
field theory. 

If the mentioned limiting process is, however, neglected it may be assumed 
that the state of the field in the considered space-time point is directly in- 
fluenced by the phenomena taking place simultaneously in the space-time 
point of a sphere with a radius À surrounding the considered point. Apparently 
this assumption represents the general idea of the non-local theory of physical 
fields. It is clear that according to this assumption the idea of the action at 
a distance recalled in the inside of the sphere with radius À, however, from the 
field theoretical point of view this assumption does not represent any contra- 
diction, because the aspect of the field theory remains macroscopically—between 
elementary particles—unchanged. 

In Yukawa’s bilocal theory the quantities of the field in their matrix re- 
presentation depend on a point pair of the four-dimensional space-time con- 
tinuum. In the corresponding classical theory of field the different quantities of 
the field have to be functions of these point pairs. Consequently the natural 
geometrical basis for the bilocal field theory is the space of point pairs (#, &). 
The ensemble of the point pairs of the world continuum should be called space B, 
the foundamental element of which is (x, x). 

The ‘point pair (7, g) is space-like, light-like and time-like, respectively, 
according to 


a)? — (4° — x)? — (x — à) 30. 


x? 202 (ml mp : 
Li #) (%, (2) (d) (2) (1) (2) 


D © 
Now, let us introduce the so-called co-ordinates of Yukawa by the defi- 

nition 

yl — 1 (gt ol re ql__ gl 

x SR da sh D - 


Instead of the point pairs (%, g) we can use the co-ordinates (+, r). Let the 
ensemble of the co-ordinates (x, r) be regarded as the basic element of the Y-space. 
Since in Yukawa’s theory, owing to physical arguments, for the vector 7: 


a condition of normalization is assumed by 
(8.1) rir =A, 


where À is a constant, the four components of r” are not independent, conse- 
quently, only one direction is determined by the second group of the co-ordi- 
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nates of YUKAWA. This means, however, that the basic element of the space Ÿ) 
can be regarded as a ground element of a line-element space, the concept of which 
was introduced in Sect. 2. 

The condition of normalization which is introduced in Yukawa’s theory 
as an auxiliary condition can be substituted by an adequate one, as follows: 
Let us introduce instead of Yukawa’s co-ordinates r“ the co-ordinates 


(8.2) vo = er", 


where o is an arbitrary positive factor and it should be supposed that the quan- 
tities of the physical field are homogeneous functions of the variable vw“ of zero 
degree. This means, however, that the quantities of the field depend on the 
basic element (x, v) of the line-element space 2. Since only a direction is de- 
fined by the vector v“ it is evident that the v“s can be regarded as homo- 
geneous directional co-ordinates, their proportion only having a meaning. So, 
in (8.2) the introduced factor o has not any role in the following and the new 
denotation of the second group of Yukawa’s co-ordinates was carried out in 
order to recall emphatically that the co-ordinates 7“ must fulfil the condition 


yh aye” ls 
vv, = const. , 


because of which only three of them are independent. 
However, owing to Euler’s theorem the homogeneity of an arbitrary fune- 
tion of v“ of zero degree means that 


(8.3) DEF = 0-5 
where the notation 


def of 
(8.4) fw Di, Ov"? 


is introduced. (8.3) has the geometrical meaning that the derivative of a scalar 
of the field according to v is a covariant vector being orthogonal to the di- 
rection of the basic line-element determined by è”. 

Otherwise, the condition (8.1) is very characteristic for Yukawa’s bilocal 
theory of fields, inasmuch as, the bilocal theory is just distinguished from the 
non-local theory of fields by the condition (8.1). Namely, also in the case of 
non-local theory of fields Yukawa’s variables can be introduced and so the 
formulation of both theories seems to be quite adequate [20]. However, in 
the case of non-local fields the differences of the points & and # are not sub- 
jected to the restriction (8.1), this means that by r= gh — gp" not only a 
direction, but, directly the difference of a and x", respectively, is deter- 
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mined; è.e. in the case of bilocal fields the field quantities depend on seven 
independent variables (four position co-ordinates x" and three independent 
directional co-ordinates), however, in the case of non-local fields the quan- 
tities of the field are functions of eight independent co-ordinates (four-position 
co-ordinates x“ and four difference of co-ordinates r“). It is not doubtful that 
this circumstance is important from the point of view of the understanding 
of the basic concept of the bilocal theory not yet pointed out. 

In order to take explicitly in a covariant way into account that the quan- 
tities of the field depend on three independent directional co-ordinates let us 
introduce three vectors à (i—1,2,3) determining three arbitrary different 
directions. Then, the three angles 


4 1 ey A 
(65) Di = arccos {g,.e" BI (geo) sp END 


can be obtained (where g,,(#, 0) is the metrical fundamental tensor of the 
line-element space £) which can be regarded as inhomogeneous directional co-ordi- 
nates in all points of the space. The angles Ÿ, were introduced in an invariant 
way. This means, however, that the angles Ÿ, are invariants of the general 
group of co-ordinate transformations. 

It is obvious that by (8.5) a one to one correspondence between the pro- 
portion of r“-s and #,-s is determined. 

Finally, it should be taken into account that in the case of line-element 
spaces, the derivatives of a field quantity in respect to the co-ordinates x" 
depened on the homogeneous directional co-ordinates. In order to restrict the 
possibilities, it may be demanded, corresponding to (8.4), that the gradient 
of the field quantities has to be orthogonal to the direction of the line element, i.e. 


(8.6) wV,f—=0, 


where V, denotes the covariant derivative in the line-element space. If the 
covariance is only suggested against linear transformations of co-ordinates, 
the partial derivative in respect to the co-ordinates æ" is a covariant vector 
too and (8.6) can be replaced by 


(8.7) | wo, = 0. 


Condition (8.7) corresponds to the well known auxiliary conditions of Yukawa’s 
theory. | 


9. — Geometrical characterization of the space. 


The ensemble of the fundamental elements (x, v) was called in the previous 
paragraph line-element space. The geometrical structure of the line-element 
space can be defined by the metrical fundamental tensor g,, = g,,(#,v) which 
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can be determined arbitrarily and it should only be assumed that g,, are homo- 
geneous functions of the variables vo“ of zero degree [10]. 

The line-elements (a, v) will be changed by transformation of the co-ordi- 
nates as follows: 


oP = (2) Note da Dont A 2. +0, 
and the law of trarisformation of tensors is given by 
por ee à D Oat” rap 
LY Oa" OÙ. Citas a te 
The one-parametric sequence of the line-elements 
=), oF = p(t) (LISA) 


is defined as a curve of space L for the direction field v"— v"(t). The length of 
the are of this curve is determined by 


t* 
dæ da” | 
s -| Tante, v) at del dt. 
to 


Now, let the scalar fundamental function 
F(a, v) = {9,,,(%, 0) 0" vt 


be introduced beeing a homogeneous function of the variable v“ of first degree. 
With help of F(x, v) the vector of unit length 


(9.1) qu det yey pe 


can be defined in the direction of the line-element. Owing to (9.1) the inhomo- 
geneous directional co-ordinate of the line-element, introduced by (8.5), is 
given by 

(9.2) D; = are cos {g,,(æ, UA] (gap(@, DAY}. 


(i) (i) 


In our space the invariant differential of a vector È" is defined by 
DE" = d& + Of, du +I", da’, 


where OC, and I", are the so-called components of connection of the space, 
which can be calculated if I,» 18 known [10]. 
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The parallel displacements of vectors in the sense of Levi-Civita are de- 
fined by 
DE = 0% 


The covariant derivative of tensors is given by 


* B * + 
Eee II E aes i ee 


pie B B 
V, Le Es LUE > Line 

where the abbreviations 
del ta eae RN Ae = NEY 


ire 


are introduced and the index « 0» means contraction with the vector 1“, e.g.: 
À det iy GRE FD *o Ju 
dat lia UE IAA 

The equation of the extremal curves, or the geodetical lines of the space are: 


dx? da! dx” 
| T* 04 SI ( 
ds? * #” ds ds Ny 


being the Euler-Lagrange equation of the variational principle 


S2 


; da“ dx” |? 
| {onto v) ds A ds == 0 . 


Si 


The curvature of the space is given by the tensors 


Tig ph PAR, 
Vida mina Nin Ay. she: dol 
Bo = A ad Sur: 
where 
(9.3) RIINA A I wih 


is the tensor of principal curvature and T',,;,,, is generally an abbreviation 
introduced by SCHOUTEN 


def } 
The 3 here He R 


By contraction of the indices 4 and 7 of R,,, Hinstein-Ricei’s tensor 


(9.4) | VEST 


x*oup? 
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furthermore, by contraction with g*° the scalar curvature of the space 


(9.5) RE fk, 


can be derived. 

It was mentioned in Sect. 2, that a homogeneous and anisotropic space is 
characterized by the metrical fundamental tensor g,, —g,,(v). Now, let us sug- 
gest that the geometrical basis of the bilocal field is a homogeneous and anisotropic 
space, i.e. its metrical fundamental tensor depends only on the directions v* 
being a so-called Minkowskian space [34]. In this case there exists a frame of 
reference in which A= 0; that is, the sealar curvature of the space vanishes. 
Therefore, the space is a flat one. 

In classical physics the field is described by one or several (real) space-time 
functions y,—#y,(x) which satisfy certain partial differential equations, the 
so-called field equations. A usual alternative procedure is to start with a 
variational principle chosen in such a way that its Euler-Lagrange differential 
equations are identical with the field equations. This method renders pos- 
sible the so-called canonical formalism of the field. 

The canonical formalism of the theory starts with the definition of the 
Lagrangian © of the field. In the usual point space the Lagrangian is a scalar 
density and therefore the volume integral 


i =[ (are, 
Q 


—the so-called integral of action—is an invariant of the transformation of 
co-ordinates. 

The scalar and tensor densities, respectively, can also be defined in the space 
of line-elements by the usual law of transformation: 


pt BoB dr 
L= A-12: oe OO. _, 027 Oa" da B 
; sii Où dal Oa” "a 
respectively, and the Lagrangian 2 = £(#, v) can be introduced without any 
difficulties. However, the integral of action 


(9.6) I(v) =| 200, v)d'æ, 
a 


depending on v, has only a definite meaning in the usual sense if, in all points 
of the space a direction is given by vo = v"(«). In this case—as one SAYS — 
the integral of action (9.6) refers to the field direction vo” = v" (a), 
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In the following we shall not deal with the definition quite generally which 
is associated with unnecessary difficulties, but only with a special case which 
seems to be sufficient for our pourpose: 

Let à field of directions be v“—v"(x) satisfying the differential equation 


APR 
(9.7) saat Di 0) 


where 
al’ 


qa = On" (x, v(x)) . 


The condition of integrability of the differential equation (9.7) is 
(9.8) dd — 0,0, =— RE, —=0. 


On the basis of a theorem of Frobenius [33] the fulfilment of equation (9.8) 
means that a our space of line-elements there exists a parallel displacement of 
line-elements. This is a restricting condition for the space with the immediate 
geometrical meaning that for a given direction in an arbitrary space-time 
point in every other point of our space-time world a parallel direction in the 
sense of Levi-Civita can be determined uniquely. 

Therefore, in the following the integral of action (9.6) in respect to the 
field of direction v“ = v(x) fulfilling equation (9.7) will be defined. 

In the case of Minkowskian spaces the tensor of the principal curvature, 
fi’,, vanishes identically, therefore the condition of integrability of equation 
(9.7) is fulfilled and the integral of action (9.6) can be introduced without 
difficulties. 


10. — Generalized Schrôdinger-Gordon equation of the bilbcal fields. 


In the following the scalar bilocal field theory will be dealt with. In this 
case the field is characterized by a scalar bilocal function y= y(x, v). The 
type of this theory of fields — owing to the terms introduced above—is a field 
theory of second kind. Let the Minkowskian space with the metrical funda- 
mental tensor g,; =-g,,(v) be the geometrical background of the theory. The 
explicit dependence of the metrical fundamental tensor on the homogeneous 
directional co-ordinates will not be fixed. 

Let 


(10.1) o = v*(a) 


be a field of directions fulfilling our previous equations (9.7). Such a field of 
directions exists because the space is a Minkowskian one, where by the vanishing 
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of the principal curvature tensor of the space the condition of integrability 
of the differential equation (9.7) is automatically fulfilled. Then the integral 
of action of the field is given by 


I(v) =| 20, v) dix, 


where £ is the four-dimensional domain of integration and the Lagrangian 
density of the field is a known function of the contravariant component of 
the metrical fundamental tensor g*’, as well as that of the scalar field compo- 
nent y and its derivatives: 


L= V \9\- Dlg? (a, v), p(x, v), yp (a, v)] » 


Varying the function y for the fixed region of integration, subjected to the 
restriction that the variation of y and its first derivatives at the boundary 
of the domain of integration vanish, one obtains by partial integration in the 
familiar way [10]: 


ai bia ete) pm 
arf tease oy diz. 
{2 


Ya] 


Now, for an arbitrary variation of y satisfying the above mentioned conditions 
and for arbitrary choice of the integration region the classical field must be 
determined by the condition that the integral J should be stationary, i.e. 


Vu 0: 


From this follows in all space-time points: 
CET CET 

10.2 spurl QUES for 0° = 0% 

( ) da + Oy, 0, for v v (a). 
This differential equation is the field equation of the scalar field in its implicit 
form. In order to determine the explicit dependence of y on x", as well as 
on r” we should introduce instead of the homogeneous v“-s the inhomogeneous 
directional co-ordinates #,-s which, on the basis of the absolute parallelism of 


line-elements of space are uniquely determined in all Space-time points of 
the Minkowskian space: 


1(9,) = | Qa, 9) dee. 
Q 
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Owing to the invariance of 49;-8 against any transformation of point co-ordi- 
nates this integral of action is itself an invariant too. In order to determine 
the dependence of the field function y on #; too let us introduce the new in- 
tegral of action: 


(10.3) ies {10) eo = || (x, d)d'zd'v , 

and assume that for arbitrary variation of y, again subjected to the restriction 
that the variation of y and its derivatives according to +", as well as to J; 
at the boundary of the domain of integration vanish, the classical field is 
determined by the condition that the integral I* must be stationary, i.e. 


(10.4) O1 = 0% 


Let the implicite dependence of the Lagrangian on the field function y, as 
well as on its partial derivatives be determined by 


L= V|9|L(w, Par Vo) s 


then by partial integrations 


S*I ult Sy + SE, DES a pale d°9 = 


OL L OLI ae 
JF ae OO ht OS 


is obtained, where the notation 


| © 
(10.5) Pan = dop Ea » 


is introduced. Furthermore, owing to (10.4) and taking the arbitrariness of 
dy into account, the field equation 


GET GET 08 


0e 0, 
oy OY ei 


(10.6) 


(i) 


O 


is obtained. 
Finally, let us suppose that the Lagrangian of the field is quadratic in y 


and its derivatives, that is, e.9.: 


(10.7) L= 3vigl{d' "PP + Bey Pay + CPP, + D'yy,, + Ey}, 
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where the coefficients 4”, B*, C*, D' and E, respectively, are constants, or 
functions of 9,-8 fulfilling the symmetry relations: 


(10.8) AM ea A Be — BY. 

Consequently the field equations 

(10.9) {A°P 9, 0g + B"0,,0w — E}y(x, 9) = 0 

or 

(10.10) {A°P(9) 0,0, SE Bd) Ou) On) ae Br 0)0% = 3D' (9) + E(d)}y(«, 0) FR 0 


are obtained. 
Let us suppose that 


(10.11) (a, 0) = X(x) (9), 
then the field equations are separable and 


(10.12) AMX ,—wX=0;  B*0,,,+(u—E) —0, 


as well as 


AS(O)X y — 2X = 0, 
(10.13) 


B¥(P)6 5 + BA ste (u?+ 3D (9) "TE E(9))0 = 0 


are obtained. 


The constant of separation mw is generally quantized and can be brought 
into relation with the mass quantization. 
If 


(10.14) A° = g*®(9), Bk = git(9),  Di= const,  E = x? 


and the notations 


@ 


(10.15) “00, and ALI) 
are introduced the field equation 


(10.16) {0 qo ie 2} y(x, 0) = 0 


(8) 


is obtained being the generalized Schrôdinger-Gordon equation of the bilocal field: 
Let the original homogeneous directional co-ordinates v* again be intro- 


pn A AMT OL ad” 
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duced, then 


ov, 0 dd à 
lei) | gi 
È _ OÙ, 00” (o dd, so) 1 


and for the field equation explicitely 


(10.17) 


pplz, v) | d0° O0 | do dy(a, v 
op CPM?) È y(2, v) 
Ma ant 59,308" 3p, do | ads cae? 


is obtained which is the explicit form of the generalized Schrédinger-Gordon 
equation. By substituting the factorized form of 


p(x, 0) = X(#)V(0) 


into (10.17) we have instead of (10.13) 


(10.18) 


ox Cv à o OV 
ap _ 2 ge nni gie LE = = 
SE e dd 93, do (0 "5A, Di ur) Vie 05 


being the separated field equations of the bilocal field. 


11. — Discussions. 


It is well known that the second group of Yukawa’s variables v“, as well 
as the corresponding inhomogeneous co-ordinates of direction Ÿ,, respectively, 
are usually interpreted as the co-ordinates of internal motions of the field. 
This introduced internal degree of freedom is related to the non-localizability 
of the field itself. The operators 


F(v)#À + Hv), or F,,(0) 24 + ES) 
can be chosen in such a way that their spectrum should be positive and discrete 
the eigenvalues uw, (n=1, 2, 3, ...) being the masses of the different elementary 
particles the family of which is generated by the theory. The eigenfunctions 
0,(9) and V,(v), respectively, are usually termed internal eigenfunctions. The 
internal eigenfunctions play the role of the convergence factor of the non-local 
fields, however, as was pointed out by Yukawa there is an essential difference 
between the bilocal and the non-local theory of fields. Namely, in the case 
of the bilocal theory of fields all the particles w, which were derived from the 
eigenvalue problems (10.10) or (10.17) are taken simultaneously into account. 
Furthermore, the form functions for each of these particles are uniquely de- 
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termined by the same eigenvalue problem. However, further details of the 
theory of elementary particles will not be dealt with here. 

The equations (10.10) and (10.17), as well as (10.13) and (10.18), respectively, 
are completely equivalent. Often the use of (10.10) and (10.13) is more suitable. 
Otherwise, they are equivalent with the equations of Yukawa and in a parti- 
cular case they reduce to Rayski’s ones. 

Now, let us take the three-dimensional hyperplane 


(1141) x = const , 


into account, then in all points of the hyperplane the homogeneous directional 
co-ordinates vo” are distinguished by w—0. These directional co-ordinates 
are, however, subjected to the restriction v“v, = const, therefore, in this case 
only two independent inhomogeneous directional co-ordinates can be intro- 
duced. (This circumstance corresponds to the fact that in the three-dimensional 
space there are two independent polar angles). This means, however, that at 
contemporary points of the space-time world the internal structure of the 
bilocal field is determined by two internal co-ordinates leading to the rotator 
model of elementary particles of RAYSKI [26, 28]. 

The procedure mentioned is not independent of the frame of reference and 
is not covariant against Lorentz-transformations. However, if instead of the 
plane (11.1) a space-like hyperplane o is introduced the above considerations 
can be given in a covariant way too. 


III. — Differential structure of classical fields in generalized metrical spaces. 


12. — Classical theory of fields of second kind in general spaces. 


The above considerations may already illustrate the usefulness of the ge- 
neral concept of the new geometrical aspect suggested. The theory of electro- 
magnetic field in moving dielectrica, as well as the classical theory of scalar 
bilocal field elaborated in the previous chapters in broad outlines do not claim 
the right of being a final and complete theory, however, they justified the fact 
that the elaboration of the theory of physical fields of second kind in generalized 
metrical spaces is from the physical point of view of interest for future inves- 
tigations in the topic of theory of fields. 

By the last analysis, as is obvious owing to the above considerations, the 
original Riemannian idea is recalled according to which the geometrical structure 
of the space is determined by the physical properties-of the matter. As is 
well known, this Riemannian aspect was used by Einstein for a direct geo- 
metrization of the gravitational field in the case of the general theory of rela- 
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tivity. According to the new aspect, suggested here, the Riemannian idea and 
the original field-theoretical aspect should be reconciled by the assumption tha tthe 
physical properties of the medium where the physical field is excited may be taken 
into account by the geometrical structure of the space and the field is described 
by the usual methods of the field theories of second kind having as geometrical 
background the adequate metrical space introduced. 

In the case of the above first model example the metrical structure of the 
Space was directly determined by the material constants of the medium. 
However, in the case of the bilocal field theory the natural geometrical basis 
Seems directly to be a space with generalized metrical properties. 

In this chapter the general theory will be dealt with. Namely, the metrical 
geometrical space in which the field is excited should be regarded as a gene- 
ralized one, the structure of which is determined by an arbitrary metrical 
fundamental tensor. 

For the sake of generality let us suppose that the space considered is a ge- 
neralized line-element space with the metrical fundamental tensor Yu =I w( V)- 

By defining the Lagrangian of the field in the introduced general space 
too, the covariant field equations and conservation laws will be deduced. The 
tensor of energy and impulse will be also derived on the basis of the results 
mentioned above in a quite general form, for the different well known cases 
elaborated previously by HILBERT [6], Born [3], ROSENFELD [30] and DE 
Wert [5], respectively, it can be obtained directly by specialization. 


13. — Geometrical preliminaries. 


The geometrical preliminaries were discussed above, especially, in Sect. 9; 
further details can be found in previous paper [10]. In the present paragraph 
only some further concepts—important in the following—will be reviewed 

It is well known that in the immediate surroundings of a point of the 
Riemannian space a pseudo-Euclidian metric can be introduced; that is, to 
every point of the Riemannian space a « tangential pseudo-Euclidian space » 
can be given. 

If in the space of line-elements absolute parallelism of the line-elements exists 
we can costruct to every line-element of our space an osculating Riemannian 
space fulfilling the following conditions: 


a) The metrical fundamental tensor y,, of the osculating Riemannian 
space is identical with the metrical fundamental tensor of the original line-element 


space, that is 
Vu (@) Pe PACA v(2)) ? 


respectively, or owing to the homogenity of zero order of g,,(x,v) in the 
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variable v": 
(13.1) Vu) = Iny(@, lx) ; 


b) The geodetical lines of both spaces osculate each other; 


æ1t 


c) The invariant differential and the covariant derivative of the vectors È 


are identical in both spaces; 
d) The tensors of principal curvature of both spaces are the same. 


This construction of the osculating Riemannian space differs essentially from 
the Vargaian one [33] being far simpler and it is based on the existence of 
absolute parallelism in the space. 

To prove the correctness of our construction we have to calculate the para- 
‘meter of connection in the osculating Riemannian space: 


— cl 
Dep 9 { Op Vor + OY er a 0:V2p} = 


al lf 
“i 5 À Car ua 0,9 ee — Cap} + Das 


o C9 ar io Cas A 70 
5 | A dol 81? — | 


En ke 


Based on (9.7) and taking into account that—owing to the homogenity of 
(— 1)-th order in [? — 


CI 

a — POI » 
we obtain immediately 
(13.2) Te Lei Ua) 4 


Furthermore, 
QT, = 0,1, dl) 
and based on the definition of the tensor of principal curvature 


RP, = RP (a Ur), 


X*0T 


where Ris Riemann’s tensor of curvature of the osculating Riemannian 
space. 


However, in our case in the line-elements space absolute parallelism of the 
line-elements exists, hence 


* 
are B 
Le. = Rf," pe ? 
therefore 

FRE 

ER, pri. Ria 


().e.d. 


‘22 
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It is, of course, obvious that the construction of the osculating Riemannian 
space is independent of the particular choice of the co-ordinate system. 

The infinitesimal transformation of the co-ordinates is defined also in the 
space of line-elements by 


(13.3) at = a+ eE (x), 


where e is an infinitesimal parameter and £“(x) is an arbitrary covariant vector 
which is continuous and a limited function of the co-ordinates x. 
Let T(x,v) be a quantity of the space having an arbitrary law of trans- 


formation then we define its total, and local variation, respectively, as follows: 


or = T'(0,0)— T(x, »), 
Or = Ta, 0) — Ta, v) . 
Considering that owing to the law of transformation of 0” 
(13.4) Su" = e(0,E”)0 + O(e°), 
the connection between the two types of variation is given by 
(13.5) DT QUE EE (0, 1) = (TP O0 (Er. 


In the case of the local variation the operations 3* and è, and 0,,, res- 
pectively, can be exchanged, that is, 


DOS) = 0,(d*7) and oO) — veto Lc 
but in the case of the total variation 
(13.6) (0,7) = 0,(3T) — e{(0,T)E(0, E) + (0, T)(0,0,€)v"} + Ole?) 


and 
(13.7) C(O eh == dg (IT) (a (056°) AO (Et) x 
If the spatial law of transformation of T is given, itis possible—basing on 


our above results—to calculate the total and local variation explicitely; e.g. if 
T is a covariant tensor of second order 


ST” = ne e{(0 a) TI (0 ,€) Fe Æ Ole 
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and owing to (13.5) 
see — ef ( (3, Tyg age (à, Pò bee (0,€”) 7" a. (Oy 2"”)(0,€")v"} — O(e?) : 


In the case of tensor densities we have to calculate the variation of V |g|, 


where 
def 


g = det |g 


uv fs 
Since, as it is well known that 
5 Vig) ii VIgig, dg” = LV 19195, ’ 
we have 
Svigl=—evigl (8,8) + Ole*) , 
Now, for à tensor density 
SE = TSVIgi+Vigl 82, S*E = SX VIg|.T + VIglO*T, 


and e.g. 
(13.8) "9 = ev/|g| {(0,9)E" — (8,8")9”" — (GE)g + AU, (0,8")07} + Ole?) 


14. — Deduction of the field equations. 
Let us consider a field of directions 
(14.1) ote) 


fulfilling our previous equation (9.7), then the integral of action of the field 
is given by 


(14.2) I(v) = [20 v(æ)) d'æ , 
Q 


where © is the four-dimensional domain of integration and the Lagrangian 
density of the field is a known function of the metrical ground tensor g“”, as 
well as that of the components ®, of the field and their derivatives: 


L= Ly" (a, 0(0)); D, (x, v(w)); Dn (2, v(@)) ; B.1,,(a, v(æ))] 
with 
db 


punge RAT LR 


Die det V,®, = dat 
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and 
ag ef *0 
ag Du — FO BL sens 


being the so-called Berwald’s derivative of D. 

Varying the functions ®, for the fixed region © of integration subjected 
to the restrictions that the variations of the ®,-s and their first derivatives 
at the boundary of the domain of integration vanish, one obtains 


pag 
J US, 


RAS deu de NT 
Ana a Dur top dai Def die. 


STE 
0D, 01 #1 O@ da 


30 + | 


Based on the conditions of stationariness 
6I=0, 


we have by repeated partial integration 


08 ov au d uv ae 
ee — OT 30" 30, due = 0, 


5 
where 


CET CET 


GET ii * 
i ci : OD. OP, ya 2 


O e eri 
e O i 


However, this equation is fulfilled for arbitrary variations of the ®,-s which 
satisfy the above mentioned conditions and for an arbitrary choice of the 
integration region. Consequently for all space-time points 


0g oper, 
(14.4) TAR Dy 


=); 


is obtained. 
To put the field equations in their explicit covariant form we write 0" 


defined in (14.3) as 
uv uv ah RETT ua pee d ~~ uvÀ 
ow — A i A 1145 SU; ~~ dat a) ’ 


where the following abbreviations are introduced: 


pra del 08 ; Fei — 


OD, iy, Vi | 


Re a5; 
N s ay 4 DEN 


E 
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ae 


q 
ma Ok » | 
Ha pr dala NASA e SE ee Lai i 
2 n à) D, 3 aie il | 
AR eee a 
._ Since 
FA | 7 d v ea * x E va 
PR Vite Âge, + pokey, + FY di 
. È and owing to 
SES ses nr VATI 
nb — VÎ9g[= &Vigl- Ce VIIDET Sa = = 8, VIg|+ Ve al = 
È % da” i | 
È à nf ae 1 ,— cr 0g 
= = vita fon + Be or) vira, | 
ay . - i 
we have 
KL: DO È 
SI VIT VE = OTH + I, 
La 
x: _ therefore we obtain 
“4 x R 0 — gi» A Vig] vie ; i | 
pe: Furthermore, introducing the notation a 
af 4 en 
n OL 1 02 
SS D CC gs me Er 
SN °D, VTg1 à 1 
hee we can put our equation (14.4) in the form 
; aX “4 
di sm varie Lio — VP = 0, 
À e 
| and 
Be 
Nani a 
ii 8-0] 
\ respectively, where 
gr” get. x ae VITI] Vi pe ‘ 
Since 
A d uv ov yeu ue 
da SAT ATOS A | 
_ just as above, we have 
rh v d v “a 
V]g] V,4“ = ap À gere, : 3 
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and finally 
peo V,{F” NR V, Fe") i) È 


or explicitely 


vino, “Wai 6 


To (eae et | 1 : CL | Bee 

- VATI OD va, 
Finally, taking into account, that the determinant g of the metrical ground 
tensor g,, does not depend on ©, and its derivatives, we have 


(14.5) CO 


This equation gives for the field of direction vo“ = v“(x) the explicit covariant 
form of the field equation of our vectorial field. 

In the case when in our space the absolute parallelism of the line-elements 
does not exist basing on the Vargaian methods of construction of the osculating 
Riemannian space another version of this theory can be elaborated. How- 
ever, we shall not deal with this generalization [9] because the supposition 
of the existence of the absolute parallelism of the line-elements seems to be 
realizable in the practically interesting cases too. 

Finally, by introducing the inhomogeneous directional co-ordinates—as was 
performed in Sect. 10—the field equations can be put in another form, however, 
in the general case this will not be elaborated in the following. 

In Riemannian space equation (14.5) has the form 


OL + OL CASI 
pu 


14.6 en 
\ 0, 


where V, is the differential operator of the covariant derivative in the Rie- 
mannian space. 

In the pseudo-Euclidian space, where the components of the metrical ground 
tensor di, are constants, we have 


OL OL OL 
Le À x I ——_ = lo) lo] L : = 0 LI 
(14.7) dD Ò, 0D,, pane: eit 


Lu 


This equation is identical with the field equation of Podolsky’s generalized 
theory of electromagnetic field for an arbitrary Lagrangian L [26]. 
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15. — The differential laws of conservation. 


As is well known, basing on the infinitesimal transformation of co-ordinates 
some identities can be deduced which from the physical point of view can be 
interpreted as the differential laws of conservation of the field [25]. In the 
following also with general vectorial fields will be dealt with. 

If the variation of the Lagrangian 2 brought about by the change of co- 
ordinates is investigated we must take into account the explicit dependence 
of the Lagrangian on the contravariant components of the metrical fundamental 
tensor g” and their derivatives too. The derivatives of g’” can be found in 
the parameters of connections of the space and in their derivatives. But these 
derivatives are the partial derivatives of the g“ owing to which our Lagrangian 
density has the form 
(15.1) SER ds Gant dead Oo cia at Dur Dial 
with 

1e = 0,9” and Li pig: = Ong” 4 

If in our space of line-elements the absolute parallelism of line-elements 

exists a field of direction 


(15.2) yp" == #(0) 


satisfying our equation (9.7), the integral of action defined for this field of 
directions is 


I = [ste v(æ)) dix. 


Q 


Now, we pass over to the osculating Riemannian space introduced abo- 
ve in Sect. 13 which has the metrical fundamental tensor 
yin) = g(a, ve) ; 


therefore, basing on (9.7) 


dg!” 


Via = da + (0,.9°")(0,0°) = 0,9" — F(0,,9") Lite = da’ 
È pr 


and similarly 


ie oh d2g'” 
Vo T Qu dah * 


FA 
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Furthermore, introducing the notations 
m\ def È z def W7 x Cet È def 5 
P,.(1) So D, (æ, v(@)) ’ Pry = Ve D, 5 Pau = VaPus ; L(x) == 2e, v(x)) , 


the integral of action in the osculate Riemannian space is 
I =| etats È 


The total variation of J subjected to the restrictions that the €"-s of the 
infinitesimal transformation (13.3) at the boundary of the domain of integ- 
ration Q vanish is given by 


SI = [area i 


È 


where 
OL = ÔL == A) 
O* Lt = IQ], 0 y" = Nr Vy Vas RATE 
[ I, od tv vl CP, de E * OPy yal i 


being Z the Lagrangian in the osculating Riemannian space and 


oe if OL ne OL : 
oy” œ oy”, B OY wap) ’ 
the Lagrangian derivative of £. 


Assuming that the potentials ®, and the corresponding potentials g, in 
the osculating Riemannian space, respectively, fulfil the equation of field 


LE le = 


CIR iy Ole Si a OL A 
CP, i OP | es OPpiva 


the variation of our integral of action is reduced to 
dI 1202 d“z . 
2 
However, J is an invariant of the changes of co-ordinates, therefore 
sl = 0 
for the infinitesimal transformation of co-ordinates too. This means that 


(15.3) | [e], St dix = 0, 


2 
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for an arbitrary choice of the integration domain. But 3*y" is symmetrical 
in its indices u and v, therefore, the antisymmetrical part of the Lagrangian deri- 
vatives of L does not come into consideration. Taking into account as matters 
stand we shall introduce the symmetric tensor density 


(15.4) Z,,% —2 Sym ([£],) =—{[2], + [£1,}; 


we have 


[E d'a = 0, 


and finally —using our equation (13.8) for è*y""—this integral can be written 
in the form 


|| (i y") 25? (a sì dix = 0. 


gubjected to the restriction that €“ vanishes at the boundary of the domain 
of integration by partial integration it is obtained that 


Ms d d' = 
frere 
Q à 


However, this is an identity for arbitrary é“-s and for arbitrary choice of the 
integration region, therefore, basing on this consideration we have 


d ~~ n 1 eur d 
(15.5) da i: tre 9 T' da? Vu 0 ’ 
and 
K d x De o 
(15.6) dall A — LT; o — 0; 
respectively. 


This identities are deduced in the osculating Riemannian space. Now, 
returning from the osculating Riemannian space to our original space of 
line-elements we obtain, owing to (13.2) that 


(15.7) o, — Fd 


(0) 


DI va. ry x #4 = 0 


for the field of directions satisfying our equation (9.7). 
In Riemannian space, where Iuv = Je) equation (15.7) has the form 


(15.8) TT T0 


Li 


NEW GEOMETRICAL METHODS OF THE TEORY OF PHYSICAL FIELDS 491 


. E (0) . 
and in pseudo-Euclidian space, where g,, does not depend on the co-ordinates 
we have 


(15.9) aa MO 

Our equations (15.7)-(15.9) are the required identies which will determine 
the laws of conservation for the physical field. 
16. — The metrical tensor of energy and impulse. 

The metrical tensor of energy and impulse of the field was originally de- 
fined by D. HILBERT [6] in the Riemannian space as the coefficients of the 


6*g,,-8 in the integral 


(16.1) SI = 2/18" dG de, 


or explicitly 


(16.2) ER | 
| VIg] 
Now, owing to the identity 
UP FACE ot? 
(16.1) becomes 
(16.3) sl = —2 | [Q],yd*g"”’ dix , 
à 


and similarly the covariant components of the tensor of energy and impulse 
can be defined as 


VORO, 
(16.4) ae Vi È do 
It can be proved easily that 7” and 7,,, respectively, are the contravariant, 
and the covariant components respectively, of the same tensor. 

These considerations were valid in the Riemannian space. To define the 
metrical tensor of energy and impulse in the space of line-elements too—as- 
suming that in the space of line-elements the absolute parallelism of the line- 
elements exists we shall suppose that there is given a field of direction v“=v"(x) 
fulfilling the equations (9.7)—we introduce the metrical tensor of energy- 


s he 
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impulse (owing to (15.4) by definition) 


Pepin 08 __d 02 | 
(16.5) Ty, = DETTE leg" da“ |0g"”, ax? | 
Taking into account that 
LS =VIgl Er, 

we have 

aL dl i i OL SL 
(16.6) To dale ti — ta (og, 8° Og, dal | o 

Asa 
Lig aci ri) ee a ago. 1 


defined for the field of directions v = v“(#) where the differential operator 
d/dx* is the Berwald’s derivative introduced above. 
In Riemannian space the tensor of energy and impulse became 


ta = f OL OL = OL OL 
Fr — n ees e = 0 = = 
(16. { ) LE I uv L |eg"” x = T3: Mog B eg." na 
=, | OL —, OL OL 
œ°T Es ad Tr Og 3 — Op og”, |, 
and finally in the pseudo-Euclidian space 
(0) (0) \L OL OL 
(16.8) i = IYw [L jo" qu Tin J i 


Vago & agit, + = 80 gr fur D ? 
where L is the Lagrangian of the field in the corresponding Riemannian space. 

The metrical tensor of energy and impulse of the field (16.5) was defined 
on the basis of (15.4) passing from the osculating Riemannian space to the spa- 
ce of line-elements. Therefore, in the different spaces the tensors (16.6)-(16.8) 
satisfy the identities (15.8)-(15.10), respectively, representing also the required 
laws of conservation of energy and impulse. 

The above considerations were based on the assumption that in our space 
of line-elements the absolute parallelism of line-elements exists. The tensor 
of energy and momentum was defined in this case and it was shown that 
this tensor fulfils the laws of conservation. If the absolute parallelism of line- 
elements does not exist the tensor of energy and momentum (16.6) can be de- 
fined by the definition (16.6), however, this 7 n does not fulfil the identity (15.8) 
and, therefore, it seems not possible to give T,, some physical meaning. 


ok 


The author wishes to offer his cordial thanks to Dr. A. Méor for helpful 
discussions on the subject. 
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APPENDIX (*) 


Space-time structure and the violation of parity conservation. 


The violation of parity conservation predicted by LEE and YANG [39] 
— it has been illustrated by the cobalt experiment of Wu et al. [40] — 
was recently investigated from the point of view of relativistic invariance 
by WIGNER [41]. 

As is well-known, the consequence of this and other experiments may be 
summarized briefly saying that the symmetry of the real world is smaller than 
it had been thought. Namely, the whole experimental arrangement in the 
case of Co has at the beginning of the experiment a symmetry plane, which 
should remain valid throughout the further fate of the system. Nevertheless, 
the intensity of the / radiation is larger on one side of this plane than on 
the other one [42]. If it is, however, true that a symmetry plane remains a 
symmetry plane, the initial state of the cobalt experiment could not have 
contained a symmetry plane. Therefore, the radical solution of this paradoxical 
situation was suggested by WIGNER who stated that the polar and axial transfor- 
mation character of the electric and magnetic vector of the electromagnetic 
field has to be changed, with the immediate consequence that the charge 
density would become a pseudo-scalar rather than a scalar quantity as in the 
current theory. By this means the problem concerning the cobalt experiment 
would be solved, and the CP invariance predicted by LANDAU [43] would 
also be explained, however, this suggestion should entail further consequences 
that are at present difficult to forsee and it is inconceivable that such a radical 
change in the case of the theory of the electromagnetic field should not 
have been suggested by any previous experimental effect. 

Owing to our above considerations no less radical interpretation of the 
cobalt experiment and of other experiments which prove the violation of 
parity conservation, respectively, can be suggested in the expectation that 
it does not result any alarming consequence of the same kind. Let us namely 
suppose that by effects (especially in the case of weak interactions) violating the 
parity conservation the isotropy of the space part (or more generally that of a 
space-like surface) of the space-time world would be destroyed. 

Due to this supposition the anisotropy of the space would be determined, 
at first insight, by the experimental test quite phenomenologically. Let us 
suppose that the surface corresponding with the probability of /-decay per 
unit solid angle and per unit electron energy should be identical with the 
indicatrix of the anisotropic space. This means in terms of the suggested 
theory that the equation of this indicatrix in its parametric form is deter- 


mined explicitly by 


ai = S(E)1 { — a(E) cos D }sin Pcosy, a = S(H){1— a(E) cos J} sin 8 sin p, 
a? = S(E){1— a(E) cos 0} cos Ÿ , 


(*) Added in proof on 10th August 1958. 
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(where E is the electron energy and # is the angle between the electron mo- 
mentum and the spin direction of the decaying nucleus [42]), and the com- 
ponents of the metrical fundamental tensor of the space is than given by 


Gix(03, 02, 0°) = 0 S(E){1— a(B)o[(03)® + (v?)? + (v*)?]-#}. 


Such a space (*) is the special case of the general line-element space elabo- 
rated above. 

Obviously, because of only geometrical facts, it is clear that the reflexion 
symmetry in an anisotropic space is destroyed, however, in the case of CP trans- 
formation not only the sign of v*, but also that of a(#) is changed in the for- 
mula of the metrical fundamental tensor, which may be a natural geometrical 
interpretation of Landau’s theory. 

Since, it is well known that in the case of the longitudinal theory of neu- 
trino the asymmetry parameter a(£) has the order of magnitude w/e (where 
w is the mean velocity of the emitted electrons and e is that of light), 
this means that the anisotropy of the space tends to isotropy in the limiting 
case when the velocity (i.e. the kinetic energy) of the emitted electrons tends. 
to zero. 

The suggested theory based on the mentioned phenomenological deter- 
mination of the anisotropy of the space has, however, two alarming objection- 
able points: (i) the asymmetry parameter, if the suggested a priori anisotropy 
of the space is true at all, should be rather deduced from a law of nature 
than on the basis of the above phenomenological reasoning; (ii) it must be ex- 
plained that different asymmetry parameters follow from different decay 
processes. 

These objections have to be eliminated more accurately in the course of 
future investigations. Some months ago, when the belief was generally ac- 
cepted that the violation of parity conservation is only typical of weak inter- 
actions, it was possible to suppose that the anisotropy of the space at the 
position of the decay process depends on the number of the created neu- 
trinos (n) and for the asymmetry parameter the formula of interpolation 
a(E) = KE){n® +1 + 7(n —1)?\-1 could have been proposed. At present, 
however, when the violation of parity conservation also in the case of strong 
interactions seems to be proved, the supposition can only be suggested that 
the asymmetry parameter depends also on the internal structure of the inter- 
acting fields (in the sense of Sect. 9) and on the charges as well as on the masses 
of the interacting particles, respectively. But, unfortunately, the present: 
observational material does not render possible the determination of a — just. 
heuristical — formula for the asymmetry parameter and a start of further 
theoretical investigations. 

However, the suggested theory in terms of the unified theory of fields can 
be interpreted as follows: owing to Mach’s axiom that the laws of nature depend 
on the physical content of the wniverse, one can suggest that in the case of the 
weakest interaction (i.e. of gravitational interaction) the homogenity and in 
the case of weak interactions as well as in that of interactions, where also 


(*) This space has some interest also from the geometrical point of view. This 
is, namely, an explicit example for a space, all its three curvature tensors vanish, but 
ìt is non-Euelidian since its torsion tensor is different from zero. 
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the internal degree of freedom of the interacting fields became dominant, the 
isotropy of the space would be destroyed. Therefore, let us conclude that: 
the deviation of the structure of the space from its Euclidian character depends 
inversely proportional to the coupling of the interactions. By this means one 
can understand that the isotropy of the space (i.e. in Wigner’s terms: the 
law of symmetry) becomes dominant only in the case of electromagnetic and 
especially in that of mesonic interaction when the inhomogenity and anisotropy 
of the space is more and more overcompensated by the strength of the external 
interaction submitted by the physical fields of second kind. 

Finally, Wigner’s suggestion — and, as he has pointed out, the spirit of 
the ideas of Lee and Yang — that all symmetry properties would be only 
approximate, should be replaced by the geometrized supposition that isotropy 
and homogenity of the space, which was accepted a priori without any criticism 
in the current field theories, would be only an approximate one and by the decrease 
of the strength of interactions the anisotropy and inhomogenity of the space-time: 
world would become more and more dominant. 
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CoNTENTS. — I. The relation between P,- and Gauss quadrature solutions 
of the Milne problem. — Introduction. — 1. The P,-approximation. — 
2. Scattering involving higher atomic angular momenta. — 3. The cha- 
racteristic equation. — 4. The solution on the vacuum and the medium 


side. — 5. A short comparison with the Gauss quadrature method. — 
6. Completion of the solution. — 7. Investigation of f"(0, u). — 8. Density 
and current. — 9. Special cases. — APPENDIX I. Calculation of the sum 


over / in equation (15). - APPENDIX II. Calculation of the coefficients «,. 
— APPENDIX III. Summation of the C-W part f'(0, u). — APPENDIX IV. 
Calculation of the current at the boundary. — II. Behavior of the 
P,- solution of the Milne problem with anisotropic scattering for L + co. 
(Algebraic part). - 1. The error-source in the Boltzmann equation caused 
by spherical harmonics solutions. — 2. The functions, which tend for 
positive and negative w in the limit Loco to the exact directed flux. 


I.- The Relation between P,- and Gauss quadrature solutions 
of the Milne problem. 


Introduction. 


The fact that the Gauss quadrature (denoted here as the Chandrasekhar- 
Wick [C-W]) method and the spherical harmonies method of dealing with the 
monoenergetic transport equation are closely related, has been often pointed 


(*) Fulbright and Deutsche Forschungsgemeinschaft grantee assigned to Oak Ridge 
National Laboratory. Permanent address: Institute of Technology, Karlsruhe, Germany. 
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out in the literature. However, contrary to some statements which have been 
made, the two are not identical—basically the C-W method, as applied to 
the Milne problem, is a non-analytic approximation, whereas all functions 
used in the spherical harmonics method are continuous. It is the purpose of 
this paper to examine in detail the relation between the two methods. The 
Milne problem with p-wave scattering will be used as a convenient example 
for the analysis. Incidental to these considerations will be derived completely 
general spherical harmonics solutions to the Milne problem, and a comparison 
with the Gauss quadrature solutions will be given. In a following paper it 
will be shown how these approximate results, in the limit, give the rigorous 
results of the Wiener-Hopf method of solution of the Milne problem. 

Previous general solutions of the Milne problem by application of the sphe- 
rical harmonics method have been given by J. C. MARK (*), by B. CARLSON (?), 
and by M. C. Wane and E. Guts (*). The considerations which follow are 
more general than those of previous authors, first, in that the scattering is 
assumed anisotropic, and second, in that the relation with the Gauss quadrature 
method is shown more clearly. 


1. - The P,-approximation. 


Suppose a medium which occupies the region ¢<0 has s- and p-wave 
scattering of neutrons and has total macroscopic cross-section Y= Y,+ ,, 
where 2, and 2, are the cross-sections for absorption and scattering. The 
anisotropy of the scattering is covered by a transport cross-section 2. The 
x, y-plane is a plane boundary between the medium and the vacuum 2 > 0. 

Transport theory yields the integro-differential equation 


mei 


CHM) a 


U cE +E fl, pi ) = 5 2/tt wae +5 = ft ‘\u'du', 


D 


—1 
where y — cos Ÿ and Ÿ is the angle between the positive e-axis and the di- 
rection of the directed flux f(z, «). At 2= — co there is an infinite source of 
neutrons. It is convenient, to introduce dimensionless quantities È — X2 for 
the co-ordinate, ya=Z/2, yi=2'/X and later e=1—y,=2,/2, a=3y,(1 — y). 
The Boltzmann equation takes then the form 


Pi 


1 
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(*) J. C. MARK: The Spherical Harmonics Method, I (CRT- -340), II (CRT-338). 
(*) B. CARLSON: Neutron Diffusion-Spherical Harmonics Theory (MDDC-236). 
(5) M. C. Wana and E. GuTtH: Phys. Rev., 84, 1092 (1951). 
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Applying now the spherical harmonics method, by assuming first that 

(2) 1 1) = Ÿ ful Pan) 

one finds for the partial fluxes (47f4(£) = density, (42/3)f,.(¢) = current in 


e-direction, fs0(6), fao(6); …) the following infinite system of differential equa- 
tions (‘= 0/cC): 


1=0 bia Vador 
l=1 2 fo a foo Velie 0; 
1 LI 1 
l=2 7 Iso + a ar foo = 0, 
9 ' 7 
5 l=3 ape si 2 tes oe feo = 0, 
t+1, l j 
l DI, prolisso heres fi 10 = (is 


The general equation / holds for | > 2; the first two equations for /=0, 1, 
however, are changed or « disturbed » if y, < 1 and y,~1. One may describe 
the vacuum as far as its effect on the medium is concerned, as a perfect ab- 
sorber without scattering, è.e. y,=1, y:= 1 in the vacuum. In this case the 
general equation holds for all 1 >0, because f_,.(6)= 0; its solution in this 
case will be simpler than in the general case. 

In the L-approximation of the spherical harmonics method one cuts the 
infinite system of differential equations after the Lth-equation and demands 
that only the spherical harmonics up to and including P, shall be used in the 
series (2) as an approximation. This implies that f,,,0(0)=0, f,,,,(6) = 0, 
f,.30(0) = 0, .... The first of these equations, f,,,,(¢)= 0, turns out to be equi- 
valent to the characteristic equation, which determines the various modes of 
partial solutions of the finite system left after the truncation of the infinite 
system of differential equations. To put the remaining f, 4,66); frsa0(6); +++ =: 0 
is a process, which is, properly taken, inconsistent with the infinite system of 
differential equations, because these later harmonics are related to the non- 
vanishing f, (6) by a recursion formula which keeps them different from zero 
until 1+ oo. Indeed, putting f = 0 in the equ. {= L+1 of the system (3), 
one obtains 


L+1,0 


[(Z + 2)2L + 5)] fo + CE + TUE + Yi f = 0. 
f,:,,—0 does not satisfy this equation, because Pio is in P,-approximation 
certainly different from zero. 


Only after addition of a term [(Z+1)/(2L+1)]f; on the right hand side of this equat- 
ion it will be allowed to put f,,50 = 0. Then a glance at the further equations of the 
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system (3) shows the legality of putting f1+30=f1ia0—=f1450—=..—=0 also. Therefore 
the P,-approximation fulfils a Boltemann-equation (1) with an additional L-dependent 
source-term 

[(L + 1)/(2L + 1)](df,0/dé)P,,1(1) 


on its right hand side. This «error-source-term » leads to non-uniform convergence of the 
spherical harmonic approximations to the exact directed flux for L + oo, e.g. in the 
case of the Milne problem investigated further in part II, 1. 


The supposition 


(4) ho(6) = (21 + 1)g (A) exp [— ¢/A] (> 2) 
in the /-th equation of (3) yields a recursion formula for the g,’s 
(0 +1)91+1(4) + lgia(A) — (21 + 1)Agi(A) = 0, 


which is identical with the recursion formula of the Legendre functions of 
the 1° and 2" kind. Therefore, putting 


(9) fio(î) = (21 + 1)[A(A) P(A) + B(A)Q,(A)] exp [— C/4], 


with 2 arbitrary functions A(A) and B(4), independent of J, one is able to ful- 
fil the set of differential equations (3) for 2</<0c0. The parameter À is 
arbitrary until now and the solution f, above is a partial solution only. From | 
the Legendre function of second kind (*) 

1+4 


Qi(A) = P:(20,(2) — Wi1(4) with Qo(2) = 5 log 14) (121<1), 


one must use only the non-singular part W,_,(A), in order to avoid solutions 
which are singular at = +1 and to get an algebraic characteristic equation. 
W,_, satisfies the same recursion formula as P, with the index on W lowered 
by 1; this is evident since Q, and P, satisfy the recursion relation and Q,, 


being independent of 1, does not disturb the recursion. Hence, one adopts 
simply the pure polynomial 


(6) fig(S) = (21 + 1)[A(A) P(A) + B(A)Wi4(A)] exp [— é/4] 
as a partial solution for 1 > 2. The functions A(A) and B(A) are to be deter- 


mined by a calculation of f,(¢) and f4(£) as a consequence of the first two 
« disturbed » equations for 1=0 and 1=1. Supposing 


foo() = exp [— C/A] 


(4) E. JAHNKE and F. Empe: Tables of Functions, 4th Ed., Ch. VII. 
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one obtains from equations (3) for 


t=): fo(è) = 3yad exp [— ¢/A], 
li = is foo(C) = 5[— + =i 3 Vay 1A? | exp [— C/A] ? 
f= 2: fao(l) = 7[— (à + $ya)A + 3Yay 14°] exp [— C/A]. 


A comparison of these f,9(¢) and fso(6) with those of equation (6) yields 


(7) A(4) = di 3ya(1 Fa 7) == f= ah ’ 
B(4) ii A[1 Sr six 3Ya(1 trai v:)Æ] = À(c + aA?) . 


If we characterize the vacuum by y,=1 and y;= 1, B,,.(4) vanishes and we 
have only Legendre polynomials of 1st kind with the coefficients A_,.(4)= 1. 
Hence, the vacuum is described by the simplest solutions. The usual Milne 
problem has vacuum on one side; but one may see that the solutions above 
are adapted to two or even more media with arbitrary different ya, y#s. 

This method of connecting the solutions fw(î) with a supposed partial 
solution f,.(¢) was already used by B. CARLSON in the case of isotropic scat- 
tering. 


2. — Scattering involving higher atomic angular momenta. 


Let us interrupt briefly the course of further calculations to show that this method 
of determination of A(A) and B(A) may be extended to d, f, g,...-wave scattering. 
Then we shall have more «disturbed » equations in the system (3). We deal with 
d-wave scattering by putting a factor y, before f,, in equation {= 2, with f-wave scat- 
tering by putting a factor y; before fs) in equation J=3, and so on. 

If 7 is the first « undisturbed » equation with a factor 1 before f,, namely 1— 1: 


1+1 


eng gi bo TA lt 


then A(A) and B(A) have to be determined from the equations 


fio 
= = A(À) P;(4 Bia) W; , 
@ + Df (4) P;() + BG) W;_,(4) 
lu TIRA B A 
CI + fe (4) P741(4) + B(A) W;(A) . 


After the application of the relation 


P(A) W;(4) — Pr.1(4)Wr_(4A) =, 


502 W. KOFINK 


the solution of these equations acquires the simple form 


aot] 1 Men 

| 48) = fe) pg fn WA), 
(8 

be Lire EI : i | 

(4) = joo | + j fa Pin) — 77 3 fine Pr), - 


In a straightforward calculation one has to find fr/foo and fj. 0/foo from the « dis- 
turbed » system 1—0 until 1=7—1 and the two subsequent « undisturbed » equations 
1=1 and I=1+1. Putting the result in equations (8) one finds A(A) and B(A) in the 
extended case. 

In the special case 7=3 with y, as a factor before f,) in the equation /=2 of the 
system (3) one obtains 


Ae + ait iles oi 54]. 


Bale + ah? + o]1—af 2—e ef «|. 


where the additional terms with the factor 


(9) 


(10) b= 3(1 — ya) 


show the influence of the d-wave scattering. One recognizes that every further « dis- 
turbed » equation increases the degree in À of A(A) and B(A) by 2. The degree of the 
characteristic equation in P,-approximation, however, always remains L+1 in 4, be- 
cause it is independent of the constant factors Va» Vt> Ya». Consequently, some re- 
lations exist between the coefficients of all powers of 4? in A(A) and — B(A)/A with 
exception of the power 4°. If one writes in our special case 1=3 A(A) and B(1) as 
polynomials of 4? 


Bia 
AU) = Ait Al? + 4343 — . = B\+ Bye + BA, 
one obtains in this way the relations 
Bo = 45 B, = A, — 44, 


and the characteristic equation for the 1=3 case (the analog of the following equa- 
tion (12)) contains 


1 1 
Aya 24 pen (eff 50) = move 


as coefficient of its highest power 42+1, 


Presumably this coefficient will be simply the product of all y’s in the general case 
of arbitrary 1 < L— 1 also. 


3. — The characteristic equation. 


We return to the case 7=2, i.e. to our main course. The partial solu- 
tions foo(C), fio(î) and fio(¢) for 1 >2, when put in equation (2), give a partial 
solution f(î, 4; 4) of the infinite system (3) of differential equations, charac- 
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terized by an arbitrary parameter À. A general solution could be constructed 
by superposition of a dense distribution of such solutions 


fear | 2(2)}(£, es A)AA, 


where «(A) should be chosen so as to satisfy the boundary conditions. This 
process of satisfying the boundary conditions is reduced to an algebraic problem 
by cutting the infinite system (3) after à finite L in the spherical harmonics 
approximation. Then L-+1 equations remain, the last one being 


L +1 = 
OE 4 3 let oy ql 10t fo = 0. 


(11) 
In the P,-approximation, however, f,,,, is not allowed to appear; this is only 
the case if one selects the /’s of the partial solutions (5) in such a manner, that 
t..10(4) with these special /,’s vanishes. Therefore, the characteristic equa- 
tion in L-approximation is 


fr) = AAA) 4 B(4,)W,(4,) == il 
(12) 4.0. , 
(1 + aÆ)P,.,(4,) — A,(0+ a42)W,(4,)= 0. 


This is an algebraic equation of degree L+1, because the coefficients of the 
higher powers cancel. If L is odd, L+1 is even and the equation is of degree 
(L+1)/2 in 4; hence, one has (L+1)/2 pairs of roots 4, — 4. In case L 
is even, L+1 is odd, a root A= 0 appears. This event is uncomfortable and 
the following treatment is for odd L only. 


4. — The solutions on the vacuum and the medium side. 


Having obtained the f,,.(¢)’s, we put them in equation (2) and obtain the 
solution for f(z, u) 


a) On the vacuum side È has a different meaning in comparison with the 
medium side; it is ¢= £2 with an arbitrarily chosen cross section for ab- 
sorption £,= ££; £,=-0. After the calculation one may put Æ, —c but 
this is not essential. The solution on this side, 2 > 0, is in P,-approximation 


(13) f(e, 4) SD (Bs exp [— é/u,] xe P,(us)P(u) | 


(L+ 1)/2 


= (L +1)P;uu Ra EM ep [— Clea), 
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with arbitrary constants f;, which are found later from the boundary condi- 
tions at z=0. The characteristic equation is 


(14) P, (ui) = 0, 


from which we keep the positive roots uw; (j=1, 2,...,(Z+1)/2). One has to 
take positive roots only because f(z,) should tend to zero for 2 = + 00; 
hence the solution satisfies the boundary condition of the Milne problem at 
z—co. From the second expression for f(z, 4) in equation (13) one sees im- 
mediately that it has zeros in the directions u=—w; for all 2 > 0. Hence 
we fulfil the condition of no backward flux into the medium only in (L+1)/2 
discrete backward cones in the P,-approximation. This result is valid for 
z=0 also and is essential for establishing the boundary conditions on the 
medium side. f(£,u) has no poles in the forward directions «= + 4;, because 
the denominator u — 4; cancels against an equal factor in 


(Z+1)/2 


P, A(u) = (2L +2) UGC CESR ESS 8 ft — U;)(u +) - 
In the directions of the cones u= +; we have a finite flux for 2 > 0 


fe, + u;) = (L+ BP) P(u) exp [— C/#;)] = 


eee exp [— ê/u;]. 


b) The solution on the medium side, which must satisfy the boundary 
conditions at z=0 later, is in P,-approximation, <0, ¢ = Xz, 


(ZL—1)/2 


(15) f(z) = 2 [es exp LIL — 3yadatt + 


+ >» (21 + 1)((1 + a2) P(—A,) + A,( e+ a22)W,_.(—A,)) Pl} » 


with arbitrary constants «,. Instead of counting the roots froms 1, 2, ..., 
(L+1)/2 in the sum (15), the largest root 1, is distinguished from the others FO 
Azz Ag; oy À, 179 because it, being essentially the diffusion length, plays 
a somewhat different role in the Milne problem, which contains an infinite 
source at 2——co, than the other roots. To represent this fact, the sum 
contains one term with the coefficient x; and the root À =—A/,. The expo- 
ponential factor exp [C/;) = exp [— 6/41 in this case tends to infinity for 
¢—+—oco. It is the slowest decreasing solution with negative 7°s, if one 
follows the positive 2-direction from the source at 2 = — co, because A, is 
the largest root of (12). Therefore, (15) already satisfies the boundary con- 
dition of the Milne problem at 2—— co and the extra term k=0 is chosen 
to give the asymptotic behavior of f(£, u) at £ = — co. 


mn 
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By applying Christoffel’s sum formulae—see Appendix I—on the sum 
over J in equation (15) and by using the characteristic equation (12) one may 


give the equation (15) a simpler form (2 < 0) 


(Z—1)/2 2 
(6) flex) = Ÿ cad exp cla} Oh + Enel | 
= 0,0,1... k L+1\ 4k 


One recognizes immediately from equation (15) that f(z, 4) is a polynomial 
in u of at most the degree L in u. The equivalent expression (16) has no poles. 


in the directions y = —A,; the denominators Z,+4 are removable again by 
developing P, .,(u) —P,,,(—A,) in a power series of u + A,. 


In the following we always denote by >} a sum over k=0, 0,1,..., 
k 


(L—1)/2 and we take an index s for summing only over s= 1, 2, ..., (Z—1)/2 
after removing the 2 summands with k—0, 0. 
If we separate f(z, u)=f'(z, u) +f" (ze. u) into two parts 


c + ai 
Act f 


e+ ali P;h(u) 
An + U Pryx(t Aa) i 


(17) (4 = 2 Op Ay exp [C/Ax] — ah, + 


(18) fe, 4) = — > aA, exp [C/Ax] 


we have in f'(z, u) the result, which could be found by applying the Gauss 
quadrature method to the problem according to C-W for positive u. f'(z, 4) 


has poles, however, in the (L+1)/2 negative directions u —— A, and must 


be replaced in the C-W method by another expression using the original 
integral equation (see part II, 2). f'(+,u) is zero for u—u,; therefore the 
fluxes according to both methods coincide in the (L+1)/2 positive directions. 
=p; for alle<0. The addition of f"(z,u) removes the poles of {'(2, 4) 
at u — — À, again; the spherical harmonics method unlike the Gauss quadrature 
method yields in f(z, u) a continuous expression for all w’s but at the cost of 
a poorer approximation compared with the Gauss quadrature method. 

Considering now the boundary at 2— 0 we remark that f(0, u) from equa- 
tion (16) for z < 0 contains the (L+3)/2 unknown constants a, and from equa- 
tion (13) for 2 > 0 the (L+1)/2 unknown constants f;. Therefore one of the 
«,’8 is still available for the normalization of the asymptotic behavior of f(z, 4) 
at g=—oo. We impose on the first 2 coefficients the condition & + =1. 
This means that we choose the solution for the density 4zf,,(¢) in the fol- 
lowing manner (Ë < 0) 


(19) foo(S) = cosh i (tee — do) sinh > SE D x, EXP i) 
ho + 
— cosh à sinh è + Du exp i ; 
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with the linear extrapolation distance 0 = Ao/(%;— %). The distance of the ex- 
trapolated endpoint A is related to 6 by the equation 


(20) A= Flog a5. 


An alternate representation of f,.(¢) is therefore 


ink — C)/Ao 3 
en fo) = ES + Sa exp [6/2 (<0). 


6 and JA are dimensionless quantities which are to be multiplied by 27° in 
order to obtain the extrapolation distances in centimeters. The ba is the 


transient part, which does not contribute to the asymptotic behavior at 
z=—00, because the A, are smaller than A). 


5. — A short comparison with the Gauss quadrature method (5). 
i 


The characteristic equation in the spherical harmonics method was shown to be 
(1+adz) Prat) — Ax(c+aXî)W,(4;)= 0. W,(4:) may be represented (*) by 


+1 


1 | Pr+x(Ax) — Pr+1(0) 
du 


0) WA = 
(22) Wile) =5 eee 
St 
The integrand is a polynomial of degree L in uw; hence the integral is given correctly 
by the Gauss quadrature method, which is here used only as a mathematical aid to give 
another correct expression for W,(A,): 


(23) W Qu) (A 1 Vey Pray (Ay) — P,4+1(4;) 1 De Pr41(4x) 
2 LAK) = j ana ETA 
Daan dx — Mi 21 Aa 


— because P,;,(u;) = 0 —, where 
+1 
ACTES 1 frees 9 Qui) 9 W,(u;) 
i I = 4 D 
Put) — gu; Prat) Pris) 


—1 


Putting the last value of a; in equation (23) one obtains the formula 


4 A(L+1)/2 W . 
Wu) = Pour) Ae MEAN 
j= +1 (x — Ms) Py 145) 


(5) A similar comparison has been made by J. C. Mark, lc. (1). 
(°) A. ERDÉLY et al.: Higher Transcendental Functions, Vol. I, p. 154, Eq. (31). 
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which could be found by an application of the Lagrange interpolation formula also. 
With equation (23) the characteristic equation may be written in the form 


1 +G+0/2 a; 
L+ a3 — ile + al?) — 
2 5241 A — bj 


Py 14) = 0, 


where the factor P,;,(A,) is different from zero as long as ¢40, a0. Hence, the 
characteristic equation of the spherical harmonics method is also 


1 <Z+1)/2 a; 


(24) 1+ ai, — Ax(c + ay) > = 0. 
2 5551 À — Wi 
or, Summing over positive j’s only and using a_;=a,;, #_;j;= — 4; 
(L+1)/2 a; pais aa 


i A — n c++ ad)” 


The same equation will now be derived with the C-W-method. The original Boltzmann 
equation was 


41 sal 
Mia HG De - ms wa + 5 (1 = pouf pi) dp". 
: =a i 
Integrating this equation over x from — 1 to +1 one gets 
+1 +1 +1 +1 +1 +1 
= MAS, u)du+ | f(£, u)du= È (Ls yf pau] dy Lo craft) ay [nay 
at -1 a =i] -1 -1 
The last integral over w vanishes and one finds the relation 
+1 +1 
(25) 2 [ftt man = fie au, 
1 chi 


between the two integrals on the right side of the Boltzmann equation. This is an 
essential equation for solving the anisotropic case with the C-W-method. If one as- 
sumes now a partial solution with an arbitrary constant 4, 


f(o, u) — g(u) exp [— €/4,] 
one has 


of 1 a 
ac Se hel CA] 


and from equation (25) 


ei +1 


fuse. u) du = vf nu) du. 


A edi 
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Hence, the Boltzmann equation yields 


+t 


3 
ae — Ya) Ste run] [aan s(c+aku), 


€ 


(26) (—£ + 1Jaw = 
i -1 
with 
+1 


C = fon) du = const 


=f 
and one obtains the solution 


GA 
2 


Re (e + aà,u) 


(27) g(u) = 
as a u-dependent amplitude of the partial solution f(¢, x) above. The expression (27) 
for g(u) shows a pole at u=4,. In conformity with the physical conditions of the 
Milne problem one supposes therefore g(u)=0 for positive «x and equal to (27) for 
negative u at €¢=0. Equations (26) and (27) are consistent only for certain 4,’s, which 
one gets by carrying out the integral 


+1 0 


O [ A,(e + ai,u) 
0-fowan=5 | Far du 


i —1 


Hence, it follows a transcendental characteristic equation for the A,’s: 


144. 


T 


(27a) 1 + ak — 1,(e + aA?) log 


Selecting the 4,’s as roots of this equation, C in (27) may be considered as an arbitrary 
constant. For €40, however, the backward scattering has to be considered and the 
divergence of the integral would become manifest. 

Therefore, one replaces the integral equation (26) by a system of linear equations, 
evaluating the integral in L-approximation by Gauss quadrature method and using 
instead of the 4, of equation (27a) another À, 


+(Z+1)/2 


i 1 : 
(28) (-£ + ) (Hi) = rac + ak) S agi) (i=+1, +2,....+(Z+1)/2) 
1 


The y,’s are also in the Gauss quadrature method the roots of P,:,(u;)—0 and we have 
here L+1 linear equations for the determination of the g(w;) which depend on 4, by 
equation (27). Therefore, one obtains by introduction of g(u) from equation (27) in 
the equation (28) the characteristic equation of the C-W-method for those possible 
i, for which the system of linear equations (28) has a solution. This gives 


1 ++ A 1 £412 e+ az 
ar SRE Nene AA) ES of an + i = 
jot k j=t1 kr M5 
a ,++1/2 1 EH aq, 
=— Zh Z a+ Zale + ad) eo 
2 Mae 2, Ax — My 


je tl 
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+(Z+1)/2 
because > a;= 2, we get finally the characteristic equation in the C-W method 
j=+1 
5 l'iG+0/2 a; 
(29) 1+ a4, —A,(e + adi) — '_ = 0 
2 32h. Way 


which is identical with equation (24). Hence their roots 4, are identical also. 
In the C-W method (7) one has now to assume a superposition of partial solu- 
tions—for the negative z of our medium with the opposite sign of the /,’s— 


rg da A . 
(30) foal, #) = ¥ g-x(u) exp [6/4] = Y cal E exp [2/4] = 
k ke et M 


e + ai 
Ax + be 


9 


= yD Oy Ay exp [6/Ag] | ad, ay) 
È 


where the «,’s replace the constants 3C,’s of equation (27) for the set of — 4,°8; as 
before they must be determined from the boundary condition at z=0. One recognizes 
the identity of the C-W expression, (30) and the first part f'(2, 2) of the spherical 
harmonics method equation (17). 


6. — Completion of the solution. 


For the fluxes at the boundary 2=0 we have two expressions: equa- 
tion (16) on the medium side and equation (13) on the vacuum side 


je [ee 2105: SEZ | 
(81) e ee A ze | 


toe Plu; 
= (L +1)P,,;,(u) iS Bi, + 


They must be equal for every 4 between —1<u<1; it can be shown by 
a suitable application of the Lagrange interpolation formula to the medium 
side of this equation, that the fs can always be chosen in such a way to fit 
this condition. To determine the remaining (L+1)/2 «,’s (this means for 
instance a, — ax =,/ò, 3, G++,» %z-1j») it is sufficient to remark that the 
vacuum side expression vanishes at u=-—u;. Hence the medium side ex- 
pression has to vanish at the same points; this yields (Z-+1)/2 inhomogeneous 
linear equations for the remaining (L+1)/2 «,’s: 


(32) D ode | ay eae (i AES ib. 
k Ay — MJ 


(7) S. CHANDRASEKHAR: Radiative Transfer (Oxford, 1950). 


Le 
LAI a A7. 


See eu 


es 


am Ha page 
seh Pn o Ory E 


A, 


a 


lee ee EN ES € 


4 > sii PAL 
i ne er idole ne 
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Because P,,,(-;)=0 in equation (31) the C-W method and spherical har- 
monics method lead to the same equations (32) for the coefficients «,. The 
appearance of the us in both methods is a special feature of the Milne problem 
and other problems with a vacuum on one side. In these cases the replacement 
of the vacuum by a perfect absorber leads in P, approximation to the use 
of the pure Legendre polynomials of the 1* kind on the vacuum side, whereas 
they are introduced as a mathematical substitute in the C-W method on the 
medium side. 

The calculation of the coefficients «, is carried out in the Appendix II. 
In solving the system (32) of linear equations one is led to the introduction 
of «half Legendre polynomials » which have only the positive roots 4; of (14) 
and À, (s =1, 2,..., (L—1)/2) of (12), namely 


(L+ 1)/2 (Z—1)]2 
(33) pia) =TT(A—m) and = r(4) = J] (A—A,) 


excluding the largest root A, of (12) from r(4). Intimately connected with the 
extrapolation distances 6 and A is the following function of A 


r(4)p(— 4) + r(— 4)p(4) 


34 Ô(2) = À 
sa (A) = Gt a) = rp) 


(1) is a function of A? only. One may use this function for a shorter repre- 
sentation of the coefficients x. 
After introducing two further abbreviations 


(35) Tees te 
È 


a ie dee do) da) 
1(Ag)p(— Aq) + *(— Ao) p (Ao) O(Ay) + (a/e)A5 d(Aq) ’ 


the solution «, of equation (32) may be written 


(37) a = vii O) Pi 


Gian 1 —{ 2,2.) P(A) 


(39) «= N î ae 1,50%) e+ ad  2}p(4) iu 
e e+ ads A,(4° — A2)r'(4,) AE di 
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The extrapolation distances are—see equations (19) and (20)— 


A(Ay) + (a/e)A6 d( 
aC Toe a) a 


Cl, 
a = 3ya(l — y); 


(40) si Ph 


and [tgh-! stands for the inverse function of the hyperbolic tangent] 


Ò(A0) Da 
Fa oa tgh ri Ao 00) . 


A=, bg! 


Introducing the coefficients «, in equation (16) one obtains the solution for 
the directed flux f(z, uw) in every direction u= cos? and on every plane 
2g=-¢/' of the medium in P,-approximation for the Milne problem with aniso- 
tropic scattering and absorption. The case of fission, with production of 
neutrons having the same energy as the neutrons which induced the fission, 
is covered by the formalism by putting y,< 0. 

At the boundary 2=0 the 1° part of f(0, ~) namely f'(0, u), i.e. the C-W 
part for positive u—equation (17) with the coefficients (37)-(39)—can be summed 
up to a closed expression by a calculation carried out in the Appendix III. 


À p(—u) 
Ri ut r(- à) 


(41) PO, i) = 2N(0 + 88) [1+ TG 


This expression has poles at ~=—A,. Remembering that f(0,w) is a poly- 
nomial of degree L in u coinciding with f (0, u) at the points u= 4; and having 
zeros at the points u ——w; one is able to apply the Lagrange interpolation 
formula to construct this polynomial, which represents the directed flux at 
the boundary according to the spherical harmonies method and which is regular 
for all u (—1<u<1): 

(Z+1)/2 f'(0, u;)Pry (4) 


(42) NS ain de 


: o (L+v/2 1 + (a/c) Ü(A)u; 
= 2N (ce + a43)p(— u)p(u) > D | 


By comparison of equation (42) with the second expression of equation (13) 
for ¢=0 finally the coefficients 6, of the vacuum side representation are 
obtained 


io f'(0, #3) __ 2N26(0 + AA) + (a/0) (up 
D  (L+1)P,(w)Pry(u) (L+1)P,(w)Pyx:(4)(Mi— M Hs) 


S. CHANDRASEKHAR introduced in his book Radiative Transfer, p. 127, a fun- 
ction H(u), to which the functions p(u) and r(u) here are related by the 


MPN mera) MIA PRIA Coe >) ag LE + "” N Pa CU a, SEO SI, ae l'AS EST 


AU 
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equation 
H(u) = Ahi + Agfa — p(— n) 18 — 1 p(— u) a 
NS Hyg + Mine (Ap + 4)T(— 4) VA Re (a/3) (Ay + u)r(— y) 
Ay (0) p(— 1) ( a _ | 
= ; Remark: 1—€—3 = Va 
À +e pO) r(— y) Le GENGA 


"7. — Investigation of f"(0, w). 


The representation of f(0, 4) given in equation (42) has the form of equation (13) 
for z=0, or of the second on the right side of equation (31), both of which were derived 
from a consideration of the vacuum side. Therefore it was possible also to deduce 
from formula (42) the coefficients 6; (equation (43)) of the vacuum side representation 
of f(0,u). We obtain the medium side representation if we put the «,’s of equa- 
tions (37)-(39) in the first term on the right side of equation (31) or in equation (16) 
for z=0. Then we get the medium side representation, which is rather lengthy. 

The connection, between medium side and vacuum side representation is quickly 
‘obtained by an application of the Lagrange interpolation formula to the medium side 
representation of f"(0,). To this end we construct the function f(A)=1— BA with 
the use of its values at the L+2 points Z;=(—4o: Ag, 41: «> An: — Ha — Mas ve 
—,+v/2) at which the polynomial R(A) = (A2— 2j)r(4)p(—-4) has its zeros. Then the 


‘Lagrange interpolation formula 


(44) UNE era 
R(A) all Aj's (A rs À;) R'(4;) 
gives for A = — ym 
1 1 e: 
bi Bei si +. Bay i 1 — Bay n 
Qu? — 22) r(— 1) p(u) Udo — 2) 249 1(— Ag) P(40) | (Ay + 4) 24g 7( Ap) p(—Ap) 
=))!3 1 — BA, (L+1)/2 1 + Bu; Il 
+ —— i 
2 arnazarapenti À Qu — p)(28 — 13) r(— 1) p' (uy) f 


Now f"(0, x) in medium side representation (see equation (18)) 


:+a4 Pr) e+ adi p(u)p(—p) 
f2(0; 4) = Yan LI i 
Zad tu Pel ha) LS PAPÀ) 


with the coefficients «, from equations (37)-(39) 


i, 1 + (a/e)d(Aq) Ao 1 — (a/c) d(Aq)A 
(46) = — Nile + ai) nH) pt 10 | so 
VPI EI Yz api dp yo 
Wye 1 — (a/e) d(Aq) As 
+ 24 
È 8=1 (4, + n)(25 — 3)r'(2,) n 


is simply proportional to the first 3 terms on the right side of equation (45) with the 
constant f—(a/c)ô(1,). Hence by comparison with equation (45) we may write /"(0, 1) 


“al 
; 4 
7 
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of equation (46) also in the following from 
(47) f"(0, w) = N-2%5(c+ ado) p(u) p(— 1) 


[ L+ (ale) (Aa) as 1 + (a/c) 6(Aa) us I 
UR) 2) plu) 4 (a (8 — di ij 


2 Ao ar 
eal 5( 7 ge Ni - 
Ay — peri y) 
CRE 1 + (a/e) 6(Ag) #5 


+ 2N A5 (c a ax) u)p(— p) 
x (4 — w;)(46 — # 


) 
DC bs) (es) | 
The first term on the right side of the last equation is — f!(0, u) in the form of equa- 
tion (41) again. Hence we have extracted by the use of the Lagrange formula the 
« pole-part » — f'(0, w) of f"!(0, uw). The second term is the result of the spherical har- 
monics method f(0, x) in the vacuum side representation of equation (42) again. Keeping 
equation (46) and (47) together we have the relation between the two representations 
of f(0, u) on the medium side (M.S.) and on the vacuum side (V.S.) 


JL + (a/e)6(4)u p(— 1) 


A . 2 2 
OSE ESL SCS ra np 
, 1 + (a/c) d(Aq) Ay 1 — (a/c) S(4a)Zo 
+ PP |a pyr Aa) PU) + At AA) : 
(&—1)/2 1 — (a/c) d(Ag) Ag : 
M.S. ; 
st > vo ai r'(As) Dan de 1 


AI 1 + (a/e)ò 


49 = N°24 Ro) P 
(49) o(¢ + a En (u— 4) (4-1) = 14) p' (145) 


(V.S.). 


8. — Density and current. 


In our monoenergetic treatment of the problems we may assume that our velo- 
city is the unit-velocity. Then we do not discriminate formulas for flux and density etc. 
We may replace 2,(c+a) in equation (16) with the help of characteristic 
equation (12) È; 
(1 + @4%)P ,41(4x) 
Ax(c + ai) = 
si + x) W, (x) 


and obtain another sometimes convenient formula for f(2,u), & << 0, L odd: 


ili — ah. Pirla) i = Ph 4) 


f 2 
(50) fw) = Yan exp [EIA] | SZ + ag an + 4 ji 
Because 
1) Pix) — Prn(— n) 
W,(Ax) axe 1 À ae u du , 


= 


33 - Supplemento al Nuovo Cimento. 
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we obtain quickly from (50) the density (e < 0) 
+1 
(51) ot) = 2a | fC, u)du = 47 Dx exp [6/4]: 
È d'A ah I POSSE 
ali 
— atts [a+ WA) 3) a A an = 
“1 


J 
| 
= dx Das exp [6/4] = derfalt) =~ Fe, 0 


At the boundary 2=0 the density 0(0)=47 Y x; is related to the 1* part 
k 


of directed flux f'(0, 0) in the direction ~=0 which is according to equa- 
tion (17) and equation (41) 
(0 


(52) f'(0, 0) =e > a, = BN (CA a?) PO 


Hence the density at the boundary in the spherical harmonies method is 


p(0 
(53) o(0) = 4x > @, = S7N (1 + - SA) aot 
This equation relates also the density at 2— 0 with the normalization factor 
N and could be used for a change in normalization from N to unit density. 
The normalization with N results from the choice in equation (19). 
Also the current j(z) for 2 < 0 is quickly obtained with the help of equa- 
tion (50): 


(54) j(e) = 2a fut, 0) a ; 
À i Ae 1 n IH 1 
= dn Yes exp(tiad {alt 5 [udu + i fe Fen Ent #9 ay = 


1 À ni 2H) 14 
= dn Yon, exp[/ Ar] un) Al du [Pesds} Pros} E lA 4 as He 
= 47 2 Xn EXP Ch] y Wi oes A) Por (4) A W (Ax)]= 


1 
EL > cx exp [C/Ax] a À ae LE i a,.W(ax), with (12), 


=4n(c-1) da oA, exp [C/A,] = — Any, > cà: exp [C/A,] = = prea Re 
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For 2—0 we have the current at the boundary 
+1 
L Am 
(55) 3(0) = 2 uf, u)du = “a fio(0) = — 4atyq D Arr = 
k 
161NYaAiP(Aa)P(— Aa) 
Al 7 (Aa)? (— Aa) E ce Za)P(Aa)] 


The last result is obtained by a further application of the Lagrange inter- 
polation (see Appendix IV). Equation (55) might also be used to change the 
normalization to unit current in ¢-direction. 


9. — Special cases. 
a) No absorption: ya= 0. 


For no absorption the quantities c=1—y,—>1 and a=3y,(1—y,)=0 
whatever finite value y, may have. The first two equations of the system (3) 
show that in the limit y, = 0 


(Z—1)/2 


(56) fol) = 15 +S mel, 


1 
O's 


(57) fio(S) = 


This means that the asymptotic part of f,.(¢) degenerates into a straight line 
and that f,)(¢) is a constant and does not contain the transients s. The f,(C) 
for 1 > 2 contain instead only transients and no asymptotic part. The square 
of the largest root of the characteristic equation (12) may be estimated to 


1 + (4/5)ya + (108/175) yey: 


58 Rw 
( ) 0 BvaVi 


in the general case for small y, and 1—y,. In the limit y, — 0 and arbitrary 
y. this expression goes to infinity, or precisely 


(59) 3yaytÀs > 1. 


Also À = icJa goes to infinity À >to: 


G ga = SPL Vs) a2 TA LE 
(o Le Vi e 


516 


W. KOFINK 


A development of 6(A) in a power series of 1/2 shows 


60) (4) = M — A+ LM — MM + MA (ME, — Ay) + dA — A] + 


il 
+R (a) Ps 
where 
(L+1)/2 (Z+1)/2 (Z+1)/2 
M, = > 4, M, = > Hike» i= > Mill y+ 
j=1 ik KI 
i<k Î<k<l 
(61) 
(Z—1)/2 (£-1)/2 (£- 10/2 
A, =: » Ass A, D > A4, À; ay > Al 3 ace 
“Bt Be? ae 


are the primitive symmetrical functions of the roots uw; and 4,. For |7|= co 
only the 1“ term of the series (60) remains; hence we have in the special 
case ya=0, A, +00, À, + 100 


. (62) 0(Ao) = Ò(4a) > M, — À,, 


r(Ag)t(— Ao) a = J n Vi 
r(A0)pP(— Ao) + 7(— A)p(Ao) 2(M — A) 2M_-A) 


(63) 


The two extrapolated distances become equal: 6 = A (M, — 4)/y:; the co- 
efficients x; and « diverge, but in the development of the first term of fog(é) 


a, EXP [— 6/20] + % exp [C/Ao] = 
Lx — XK 


ho 


Lg — Xo 


LL 
C +5 Ge (Oot a +. + 1— 


= Ag + Lo — 


only the sums «,+«,=1 and (x, — a%)/Ay > ye/( M, — A,) =1/6 play a role. In 
this way one gets systematically equation (56) again 


foo(S) > Te a 2 os exp[¢/A,] =1 TT + 2% exp [6/4]. 
For 


fo(6) = AURA Ay exp [— C/A] — xodo exp [C/Ao]} ST 3Ya >. aA, EXP [él] = 


3 
= {Sb — 0) — Bab + 37° (as — a)? + N — Bra Z aA, exp [JA] 
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only the first term remains after the limiting process 


Sy At di == 
hot) > Va a Ra, Mie 


which is equation (57) again. The asymptotic part plays no role in the higher 
fws,; but the coefficients x, of the transients behave regularly since 


. p(As) 
(M, — A;)Asr'(2,) | 


(64) CA 


The characteristic equation (12) is now only of the degree L —1, because 
2 roots À,, — À, are infinite, and (12) becomes 


(65) PEA) — AsW ,(4s) NI) (= Ais + Aes 2009 ab A g=ifal 


(As in (12), the largest coefficient in this equation vanishes.) One recognizes 
in (65) the independence of the 4, of y;, and (64) shows that the «, are also 
independent ot y,. Only 6=A depends on y,. The same is the case with 
directed fluxes through the boundary. The Gauss quadrature method gives 


pu) 


> 0), 
(M, — A,)r(— p)’ (u > ) 


(66) | f"(0, 7) = 


(the — sign compensates one — sign in p(—u), which contains one (— 1) — 
factor more than r(— )), and the spherical harmonics method 


p(— u)p(u) FED? ‘ll 


(67) Î(0, 1) Fai Me Ai 2 (u — ft3) 1(— 3) p' (Us) i 


b) Isotropic scattering, absorption: ya40, yr=1. (L finite). 


In this case c=1—y, is different from 1, but a=3y,(1—y)=0. The 
root À, of (12) will remain finite À ~ (1+4y,)/3y, (see equation (58)). À, = iVc/a 
however, is still infinite as À, ico.: Hence one obtains by equation (62) 
6(Az) = M, —A,. The other quantities become 


1(Ag)t(— Ao) 


a 
=A? 0 N Noy = ’ 
poe? 1 rp Ao) + rt Any PA) 
eno) pie Dida) 2Aop (As) 
(68) = N, 7 À) ’ Lo = N, r(Ao) ’ ks = N, A(z Ex 22)r' (As) ’ 


D 
Soups A= Ateo Me (s = 1, 2, | 
0 
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The directed flux at the boundary according to the Gauss quadrature method is 


a _& pu) 
(69) f'(0, u) = 2Nc 72 à 7 1) 


, (u > 0), 


and according to the spherical harmonies method 


(Z+1)/2 il 


(70) f(0, u) = 2N, cA p(— u)p(u) Ep) pr Bs) (14) | 


APPENDIX I 


Caleulation of the sum over / in equation (15). 


With Wi, = P,Q.— 4: we have ({) 


À GE + DIA + PC A) + Ale + aa AIP) = 


= 2 (21 + ILL + ax + Ac + Ag) Qo(— Ax) |Pi(— ax) P(e) — 
— Ax(e + ak)Q 1 (— h)Pi(u)} = 


— [(+ak + Ax(0+@48)Q(—A))(1— 3A) +Ax(c+ adi) (Qo(— Ax) +31 (— Ax))] = 


S PET) PER LE 
= 21 1){...}) — 1 3 six is Pi itl f 0 , 1 by, Vas 
(> ( + 1)}{...}) + 3y.Au — ad? with RR 


Hence, we obtain for the whole curly bracket in equation (15) 


(Al) 1— Spa + Ÿ (+1) €.) = 


= — ah, + [1 + adi + Aie + ah) À)] x (22 + 1)P.(— Ax) P(t) = 
1=0 


Lo ait) » (21 + 1)Q(— Ax) Pil) . 


Now we use the Christoffel formulae (see ERDÉLYI: Higher Transcendental 
Punctions, p. 162, eqs. (20) and (31)) to calculate the two sums 


L 


| vy 
> (2) + YP 4)P(1) = ath. PAPER) = PAN 


1=0 
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and 


> (1+1)0:(-2)P.(1) = 


it 
ae {(L- DIP, (M)Qz(— An) —P;(0)Q 2-41 (—Ax)]—1} : 


Putting these values in equation (A.1) and replacing the Q’s with the aid of 
the W’s one gets 


up; a, Arle + a) 
3 ire 3 sy 21 il} = D SR 
li, 
1 à 
es {[(1 + al) P;(— Ax) + Arle + ade) W 7-1 (— Ax) ]}Pz4: (4) Da 
—[(1+ ali)P,4:(— Ax) + Axe + ax) W (— Ax) )}P.(u )}. 


The coefficient of P,(u) in the last term vanishes because it is the charac- 


a 


teristic equation (12) for À =—A,. One may replace 1 + ad? in the coef- 
ficient of P,:(u) by its value in the characteristic equation 
ì A 
1+ adi = — Ax(6 + adj) Le) 
P;u(— Ax)’ 


and one obtains 
Z 2 
1 — 3y Aye + > (2 + 1){...} = — adi J Zx(0 + aÀ,) 
1—=2 


HU + À è 


Ale +422) Poli) 
Bk AA) 


The last bracket in this expression is a const. = 1/(L + 1). 
Hence we get finally for the expression 


DL + A)LPi(— Ai) Wx(- da) — Prat Ax) Wa (- 4a). 


(A2) 1—Byedye+ À (21 + DICI + at) PA da) + Alcea) Win A] 210) = 


2 Axle + ahi) | Pry(M) |= 
== == (le = = 
Mur Bi Can 
À if 2 P,A(u) | 
E= — À Ue n 
pu + À È Bieta RSA) di Ax) 


For odd L is P,:,(— 4) = Pi (A), therefore the choice of the sign of 4 ir 
arbitrary in P,:,(4). 


APPENDIX II 


Calculation of the coefficients «,. 


In the following the solution of the equations (32) 


; (Z—1)/2 ij e + ai es TESE 
(2) 10, p) = À mal an + TT) — 0 enr pare | 


520 W. KOFINK 


will be carried out in 2 ways, (a) a straightforward way and (b) a shorter way. 
Both ways show different interesting features. 


(a) The straightforward way. 
We take equation (32) for à and j and subtract f(0, — u;) — f(0, — u;) = 0 
for two arbitrary i, j = 1, 2, ... (L+1)/2, (i Æj). Then we get rid of the term 
— ad, in the curly bracket of (32): 


(A.3) » de + ah) | — = 
Ie Ie 


È axÂx(e + a?) 
= (U; — M;) 2 (io Li)(Ax — Hi) 


=0 (à Æ)). 


Originally we have (Z+3)/2 unknown coefficients x, %o, 1, ... %z-n725 this 
number is reduced by 1, however, in view of our condition of normalization 
x + a, = 1. By the subtraction above one loses one linear independent equation, 
which may be represented by the sum of the eq. (32) over j = 1, 2, ... (L+1)/2: 


(&+1)/2 GG 
(A.4) SD ees art a he 
j=1 ki \ Ay — ui) 
With the introduction of the polynomial 
(Z+1)/2 
p(2) = II (A— ps); 
j=1 

its logarithmic derivative 

p'(4) (Z+1)/2 Ji 

PURE À— p;’ 
and new variables b, for equation (A.3) 
A.5 PINETA = L-1 

_ (A.5) xl + ad) = dxP(Ax) , Roi a page Ne DE tie à 
one may write both equations 
(Z+1)/2 
(A.6) ST une, (si=na. Til; ing), 
Ù =1 

and 

L +1 4 
(A.7) > «Ax | — + ai, (0-43) p'(Ax) 2 

k 2 P(Ax) 


Let us consider first the set of 4((L+1)/2) -((L — 1)/2) equations (A.6). They 
are equivalent to (L — 1)/2 linear independent equations only; one recognizes 
this fact again by the following process. A power development of the product 


(Z+1)/2 


Il ui (y), 


whe ol 
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in powers of A, yields 


(A.8) 2 D, 2-98 ee Pt ua) dA 0-92 4 nt paptr) dx A EP? La 


u<l' 


— + — +... + (— 1)2-2(T]°** w)b:} = De 
1 


A sufficient way (*) to satisfy these equations for every choice of è, j is to 
put to zero every term in the power development 


(A.9) > bid = 0, (m Sen, 


Then we have a simpler system of (L—1)/2 linear independent equations 
again for the (L — 1)/2 unknown variables b,, di, ... b;_1,», if we regard the 
variables 6b; and b, as known quantities just for a moment: 


(Z—1)/2 
(A.10) SE th ae bay [m= 0,1). 


s=1 


L—3 
ME . 
(Z—1)/2 (Z—1)/2 


The solution of this system is with r(1)— [] (A—A,) and r'(4,) = [[°*° (4, —2,): 


j=1 j=1 


1 fr(ag)bs (Ao) bo | atei L—1 
(A.11) , b, = i! (As) lee are I rs Za) , | ca ils “5 rai | 5 
or remembering À, = — À, and the relation (A.5) between the os and b,s 
Aso 4445) s) f _ or(2o) at A) À 
A.12 , : 
ORE et) vie sana] 


This equation relates the coefficients of the transients x, to the coefficients x; 
and a, of the asymptotic part. Putting these values of «, in equation (A.7) 
we obtain a relation between «; and a; this relation together with the nor- 
malization condition x; +%=1 determines «; and x, completely. To solve 
equation (A.7) we separate the two terms in the bracket and sum over 


i SS E a > ax and 2) > OpAn(e + adi) io 


with the aid of the slightly extended Lagrange interpolation formula 


f(A;) 


Gru A 


(A.13) H(A) = RA) X 


if f(A) and R(A) are polynomials of the same degree in 1. 7, is a constant to 


be determined. 


(*) See also: Ky Fan and J. J. ANDREWs: ORNL-Report 2485. 


ral 
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(1) We consider first, using (A12) 
(A.14) a > a, Az = a(x + x) A0 + @ 2 a À = a + a 2 ast = 


Ace + pot I SL (As + Ao)ctot(A0)P(— Ao) — (As — 20) 1(— Ao) P (Ao) A. (As) 
P(Ay)p(— (A? — À5)(4 + c/a)r'(4,) 
If one chooses 


f(A) = [(A + A) otor(Ag)P(— Ao) — (A — A) 7(— 4o)P(40)]AP(A) 


= al + 


and 
RA) = (APN — 2r(a) with 2-%=2+î, Li VE. 


a 


both polynomials f(A) and R(A) are of the same degree (ZL+5)/2 in À There- 
fore one has to find 7, in the interpolation formula (A.13) by comparison of 
the coefficients of the largest power of A; it is 


To = Lo (Ap) P(— Ag) — da 7(— 0) P (A0) - 
Then a straightforward application of (A.13) at the points 


À — (— has has Aes dis eeey À Die) ; 


yields 
(A.15) a » Ay = ars ta zl © + GAG) (cor (2o)P(— Ao) — y 7(— Ao)P(40)) + 
+ = Lo — Aa) to (0)P(— A) + (Apt Aa) T(— A0)P(40)] P(— Aa) 4 
2 0 0 al“o 0 0 r(— Ta 


a PA 
+ 5 U(A0 + Aa)etor(Ao)P(— À) + (do — Aa) 7(— Aa)P (Ao) a È 


(2) We consider secondly 


p'(A) 
p(Ax) 


= Ao + 028) | PU) _ = 
AU TE ES rc 


' A.) 
© aA, C À? P : È 
> SE P(Às) 


(A.16) > cd (e + ad2) © 


Putting x, from equation (A.12) in the last sum one obtains 


(A.17) Yale + 428) Fe = 


al (c + ax) C(As + 2) (Ay) p(— ho) — (As — Ay) %57(— Ao) (Ao) \p' (As) 
P(Ao)p(— do) È (A; — 40) (4a) À 
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To this expression one has to apply the same kind of interpolation for- 
mula (A.13) with 


f(4)= [(A + Ao)tot (Ag) p(— Ay) — (A — Ay) x r(— Ay) P(Ao) p'(A) , 
and 
R(A) = (A— A,)r(A) , 


at the points A; = (4, 41; «+1 A1»). Both polynomials f(A) and R(A) are 
again of the same degree (L41) )/2 in A. The coefficient of we largest power 
A-102 of p'(A) is (L+1)/2; hence we find by comparison of the largest powers 
of f(A) and R(A) 


(a 7"(Ao)p(— Ao) — a r(— Ao)P(Ao)) . 
A straightforward application of the interpolation formula gives now 


po )aeor(Ay)p(— Ao) — (As — Ay)ag r(— Ay) \p'(A 
(A= Aa) (As) 


(A.18) 


= (ar (lo) (— Ay) — re Ay) P(Ao)) ci Xi P(Ao)P'(— Ao) — %0P(— Ao)P'(4o) - 


Putting the constituents together in > all terms with p'(4,) and p'(— À,) cancel; 
k 
one obtains the simple result for the second sum 


a PD’ (Ax) 
(A.19) 2: a, Axle + adi) ua 
I41 A(6 + asa) 

2 Pip À) 

Both sums (1) and (2) together yield now for (A.7) 
PET gl L + 1)ai, 

20) Soa — ah + (ot oll) PACCO 
*{[(Ap — Aa)%o7 (Ag) P(— Ao) + (Ay + la) — Ao) P (Ao) I7(Aa)P(— Aa) + 


+ [Ao + Aa)eor(A)P(— Ao) + (Ao — Aa)egr(— Ao) (A) — Aa)P(Aa)} = 0. 


[o%r(Ao)p(— Ao) — a r(— Ao)P(Ao)] » 


For a #0 the curly bracket {.. ss = 0 gives now the required relation between 
a; and x. In the case of Vile 1, a = 0 we have to split a in two factors Va, 
because lim À Va > 1 + $Ya +... remains finite and different from zero, and 


to nt at the curly bracket with a factor Va also for compensation of the 
denominator in À, = 7,/¢/a. Correct in all cases is the quotient 


non "pt à) Ae A) (Aa)p(- À) + (Ay + Ale Apa) _ 

TESA) Hu as AYTE ‘= 
Aa) fe) Aa Pha) — de Pr (do) 

LS A) Jo Bo (Ag) + À Fi) 
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with the notation 
(A.22) F(A) = r(4)p(— 4) + r(— A)p(A) . 
Finally we use the normalization condition « + % =1 or 1/% = 1 + a%/% 
to obtain 

Aa FA) * Ao Fs(Aq) 


Be). hn TT Ch) Fe II Ream tees 
and 
(A.24) ot de r(40)P(— do) « 


© Aa Bs (Ao) P- (Aa) — Ap P_(40) F4 (Aa) 


The introduction of the function 


Y ; r 
(54) (0) = 4 ot a) 1 AP) FU) 


has the advantage of a simple behaviour for |7|+ co, as was shown in equa- 
tion (60). It leads to a finite limit 


(L+ 1/2 (Z—1)/2 


lim 6(4) = M—A = Y u;— D> As; 


|A|->9o s=1 


or 


lim 6(A,) = lim o(i Ve) = M,—A, also. 


a—>0 a—>0 


Finally this way of caiculation of the coefficients x, shows by equation (A.9) 
that 


e+ ai Ls 
beim a 
2 vita 2% p(Ax) Aid 


or for any arbitrary polynomial f(A) of degree < (L — 3)/2 in 7 


Art = 0 for. m =:0,1, :.. 


(e + az) f(Ax) 
A.25 Ay z= . 
ee) Za D(x) È 


Examples for such f(1)'s are p"(A), r'(À) ete. 

A further result is contained in equation (A.15), if one considers that the 
sum of the 2nd and 3rd term of (A.15) vanishes in consequence of equation 
(A.20), namely the simple value for the sum 


(A.26) D nae A,(¢ + ao) (or (A5)P(— Ao) — ay (— Ay)P (Ao) 
k ap(Ao)p(— Ag) 


— ong: (1 + £28) a , 
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after introduction of «7 (equation (37)) and a, (equation (38)). Ÿ «Aj is 
k 
related to the second moment of the directed flux at the boundary. 


(b) The shorter way. 


A short calculation of the coefficients «, uses the Lagrange interpolation 
formula as a generalization of a method, which was used first by WIicK(5). 
To this end we try to reconstruct formula (32) by a convenient interpolation, 
which allows one to read off the coefficients directly by comparison of the 
constructed equation with equation (32). To do that, we represent the poly- 
nomial f(A) = (C+DA)p(4) of the degree (L+3)/2 in A twice — first by its 
values at the (L+5)/2 points 2; = (— Aa; Aa, Ao, A1 ++ Âw-wn) using the poly- 
nomial R,(A) = (42 — À5)(1 — À,)r(2) of the degree (L+5)/2 in À in the La- 
grange interpolation formula 


À; 
10) = RO ET 


where A; runs over the series of points mentioned above and with R(A) = R,(A); 
secondly we repeat this process replacing the point A, by — 1, without a change 
of the other points. This means, we use this time R(A) = R,(A) = (A? — 2°): 
-(A +A,)r(A) in the Lagrange formula. 


(1) R(A) = R,(A) yields (C, D are constants) 


(C+ DIP) _ 
(E — 284 Ar) 
(Dp de ECOLE DAL 
RTL Er Ae) A AAA — Aer (As) 
(C+ DA) HR" (C + DA,)p(A,) 
Ge br) AAA 00, — Ar)’ 


+ 


(2) R(A) = R,(A) yields 


(C + DA)p(A) 


à GNA + AA) 
(C == DA )P(— ha) (C 4 Di) p (Aa) 
TATA ZA Ag Ar Aa) (A Aa)2Aa(Aa + Ao) (Aa) 


(0 — Ddy)p(— Ao) ERP (C + DA)p(à;) 
(A + Ay)(A2— Ar Ag) #1 (A — 2) 42)(4s +-40)1" (As) 


Both formulae differ only in the sign of À. | 
Now we choose the following special values in both equations: AUS, 
for which p(A) is p(4;) = 0 and the left sides of both equations vanish; further 


(8) G. C. Wick: Zeits. f. Phys., 120, 202 (1943). 
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we put C = e and D = au;. Then we multiply the first equation by a con- 
stant A, the second by a constant B and add both together. We obtain 


, P(— Aa) edi à  P() CH OAM; | 
Pri LL ee 
GR — A sata) pla) + Ags _ 
s=1 = Ao À se Ae (45 a. A5)r' (As) As KE Bs 
= {Cae ae pe + 
+ 70)2Aat( Aa) Vo + Ao Aa — Ao 
(o + afi )P (Ae) ( das | 
(U; — Àa)2 447 (Aa) he LE Ao À + Ao f x 


We compare this equation with equation (32), which after separation of 
the terms with the indices 0 and 0 may be written in the following form 


© — aol; 


C+ ad; TP... e+ ad; 
Sap ANI Sii dina) 
o at Bs es > 


(32) a Ao Ag — Mi atl Sosta. À — bs a 


+ oo 


7 


i L +1 
(i Sea >) 


and we recognize that they would be comparable, if the right side of (A.29) 
could be put to zero. This, however, is possible, because the constants A and B 
are still arbitrary constants and because it turns out that A and B can be 
chosen independent of j. This will be shown now: the right side of (A.29) 
may be written 


+ 


PEA | (¢ — alu;)p(— Aa) (¢ + alu;)p(A) | 
244 Wii + Aa)(Aa + Ao)t(— Aa) | (3 — Aa)(Aa— A) (Za)) 


B {_ (¢—4aA,u;)p(— À) (€ + @Aatt;)P(Aa) 


DAs’ it Are i API 


a 


bar PIE] aN, 2)r(Aa)r(— Aa) {AT (4 y Aa) (Aa) p(— Aa) x (Ao + Za)r(— Aa) P(Aa)] en 


rae) [(Ao + Aa)t(Aa)p(— Aa) a (Ao = Aa)r(— Aa)p(Aa) ]} € 


That 4; is only contained in the curly brackets on the left side of the equation 
above as a factor c + au; = a(uj— À?) in this formula one owes to the fact 
that ¢ + adi =e + a(—c/a) = 0, by reason of which the originally occurring 
terms (¢ + alu; vanish. Hence A and B do not depend on the special u; chosen 
on the left side of (A.29), and the right side of equation (A.29) will vanish for 


(A.30) A = GI(A0 + Aa)t(Aa)P(— Aa) + (Ao — Za)t(— Aa)P(Aa)]; 


(A.31) B = @[(dg — Aa)r(Aa)p(— Aa) + (Ao + Aa)r(— Aa)p (Aa) ] ; 
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with an arbitrary constant G, which has to be determined later from our sup- 
position of normalization «7 + % = 1. 
After that the equations (32) and (A.29) are in form suitable for compa- 
rison, which shows 
Bp(— 2 — Ap(À 
CAR Bee p{ Lo) L %oÂo = A 0) 


RA 


A B P(Às) 
FA = ul Tia A?) 


(A.32) 


One may use the first two equations to remove A and B from the last, where- 
upon one obtains a sometimes useful short representation of the coefficients x, 
of the transients in terms of «> and a (again (A.12). 


Ao(6 + as) p(As) CA AU) as (— Ao) | 
Al Le ) 
Bellani Ai 0] 


L—1 
(see. 


1 


From the normalization condition 


Br(A,)p(— Ao) — Ar(— Ay) P(Ao) 
of — Aa) (A0) (— Ao) 


(A.33) ae + do = = 


one may find now the constant G, using the abbreviations (A.22) 


Ao(40 — a) (Aa) (— Ao) 


(A.34) SRI) ATA) 


We put this expression for G in the equations for the constants A, B 


SA A fom A Ae Peay oe Ar Ba) 


(A.35) A= PQ) Fula DAME 
Le A2)r(Ap)r(— AA F+ (À ne Aa F_(Aa)] 
(A.36) UR P_(40) Ps (Aa) — fa F0) FU) 


and in the equations for the coefficients a>, %, %s 


a [Aa PAX) Ts o P.( FAG) r(4o — Ay) 
SOTA FA) PA A Fs Pi 

= [Aa F_(Aa) = Ao F4( Are p(Ao) 
UA di 3 


TA, P(e) + À Faro do) 0 + 405 240P(A.) 
Gia = (Ao) F(A.) — dy P (40) Bala) € + a2 (A MA (A)? 
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and finally in the equation for the linear extrapolation distance 


| 5e 3h y, 28-(0)F4(%0) — Aa Piso) F(a) 
a Data Do Eole) EAU 


After that the solution is complete; the corresponding quantities are equal 
to those of the equations (37), (38), (39), if one remembers the abbreviations 
(34), (35), and (36). 


APPENDIX III 


Summation of the C-W part j'(0,u). 


(1) The sum, which has to be carried out is 


(Z—1)/2 2 
(17) F0, 4) = > aad |— ads + GET 
0.1... k 


k=0 
The first part of this sum was found already (see equation (A.26)) 


Aole + ads) 


pp A Lee 20) (a) — cao) da 


(4.39)  — a Dod? = 
k 


The second part is 


+ al SDA To La 
a As = 
Ata as ae As +u 


e+ las e+ ax 
A.40 Re = 
( ) 2 LAVA po i x ho À u 


By introduction of the values of «, from equation (A.12) in the last sum 
one obtains 


(Z—1)/2 e + a? 


CATA, pg 
fra A,(e + ads) au F(A, + A) %o? (Ay) P(— Ap) — (As — A )aor(— Ao) P (Ao) IP (4) 
P(Ao)p(— Ao) sal (A, + u)(Ai re Ao) (As) | 


Numerator and denominator in the sum are both of degree (L+3)/2 in 4,: 
An application of the extended Lagrange interpolation formula (A.13) with 


f(A) = [(4 + Ay) &or(Ao)p(— Ao) — (A — lo) (— Ao)P(Ao)]P(A4) , 
and 
R(A) = (4 — Ap)r(A) , 
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at the (L+3)/2 points 2; = (— À, Ag, Ai,» Au») requires a comparison of 
the largest powers +2 on both sides of the equation to find 


To = x(A)P(— Ay) = apt 2o)D (0) : 


Then a straightforward application of the interpolation formula with À = — IT) 
yields 
[A+ Aa) — (A — Anjou 0) PA) _ 
8 (A, = Bas vii À5)r' (As) 
as x 
= (ho) DP(— Ao) — ar — A) (A0) + PP do) | i 7) ; 


I Lu x 2o)%07 (Ay) p(— Ao) as (u on A )agr(— Ao) (Ao) 1p (— 4) 
| (u? — Ag)r(— 1) 


Hence, one obtains by addition of all parts of the sum (17) the Gauss qua- 
drature part of the directed flux on the boundary: 


| on — A6 +425) (At) p(Ao)ag+ (on) (A) A) D(— 11) 
SLR P(Ao)P(— Ao) Ao pa r(— n) 


Introduction of x; and x, from equations (A.23, A.24) completes the expression 


ZAg(e + adoyr(Ag)r(— Ao) Aa P_(Aa) — pt F(a) (— 4) 


ee EO) LEQ) de rea) 


Finally the use of 6(A) equation (37), 2, equation (38) and N equation (39) 
leads to the form of f'(0,u) in equation (41), because 


+ F_(Aa) _ U Fs (Aa) = AG BAe È ie 210%.) ’ 


and N may be represented also in the following manner 


r(Ag)t(— Ap) Aa F (A) 
eS) Cert 


(2) A shorter way to find f'(0, «) is connected again with the super- 
position of A times equation (A.27) + B times equation (A.28) and the choice 


of © =c and D —— au in the resulting expression. One gets by this pro- 
cedure the formula 
vecia P | _ y A = Mott) (Ao) 
Cru) (i TE =i Go + 108 — 23e) 
ti (C+ alu)p(— Ao) È (¢ — aAu)p(As) | dolo Bb | 
(Ao — (Ai — A8) (— Ap) À (As +) — Aa) (Ae) Vo À gt Ac!’ 


34 - Supplemento al Nuovo Cimento. 
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where À and B have the values of equations (A.30), (A.31). In consequence 
of (A.32) 


LPC) — Ap() 
ssd alito UE SUR? 
Oy hig = TARE aA) 

A B p(s) 
À; © |a ae ae 


Io di LÉ ea eee CAE 


we have on the right side just 


which is f(0,u). Hence, again 


B A |p 
(442) PO =] FE nie 
BARC + ali) Fa(Aa)t — Aa F (à) PW) 
i Ag (A) Pad) — AP +(4) F(A N 38 we r(— 4)" 


APPENDIX IV 


Calculation of the current at the boundary. 
To calculate 


(Z —1)/2 (L—1)/2 


(A.44) xx = — doti + loto + D aA, = — (x — do) À + D aA, 


Km 0,0; Lie s=1 s 


we use equation (A.12), which gives the xs in terms of «; and a: 
Then we have 


(A.45) Dondi = 


_ Rolo + Ad) (Ao + As)ocor(Ao)P(— Ao) + (Ao — 2% (— Ag)D(Ao) |p(As) 
ap (Ao)P(— Ay) 4 (As — Ag)(As — Aar’(As) 


We apply the Lagrange interpolation formula in the simple form (44) to a 
function 


F(A) = L(A + Ao)ecot(A0)P(— Ao) + (Ap — A)r(— Ao)P(40)]P (7) of the degree ae : 
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in A with an aid function R(A)= (A — ,)(4? — )r(2) of degree (L+5)/2 in À 
at its (L+5)/2 ZeTOS Aj= oa hay hay hos Ai; sie) Axe). Then we obtain 


[(Ao +4) )æor (2 0) Pp(— ; A.) 4 (Ao iF or (- 2o)P (Ao) ]P (2) per 


__TlAo — Aa)otar( Ao) (— 45) + (Ag — Js 
(A SE 1124400 a) (— Ag) 
Fa Lo + Aa) oor (Ay) p( - À) t (Ay = Aa)aer(— 0) (40) IP (Aa) | 
(A — Aa)ZAa (A — Aa) (Aa) 
i ZAoxoP(Ao)P(— do) | y EU + As)etot(Ao)P(— Ao) + (Ao — As)asr(— Ay) JP (A. 
CATA (A — À) (A— A,)(As — Ay) (42 — 22)r "= 
Putting À = — 7, we find the sum >, required above: 
| [(A + A5) (20)P(— Ap) + (Ay — At r(— Ay) \p(A 
A.46 ee re = 
ADI CE DC — Apr (Aa) 
_ P(Ao)P(— Ao) lar Lo + Ac) (Ao) P(— Ao) + (Ao Aa)t37(—A0)P(Ao)]P (Aa) 
Aa — 1. 4 : Aa (A5 — 22)r ae 


The last term on the right side is found by an application of the relation (A.21) 


between «- and x, 
Putting (A.45) with (A.46) in equation (A.44), the first two terms cancel. 


We obtain for the sum considered in (A.44) 


Sa ho (Ay ss a) (o)P(— Ay) &o aie (Ay — Aa)? (= Ao)P(Ao)%5 P(Aa 


= e PAPE À) rl 


x=0,0,1,... 


After introducing « and « (see equations (A.23), (A.24)) we get 


(&=1/2 257 (A) (— Ao) P (Aa)[ F(A) + F-(Aa)] = 
ee TER EE Bald) PRESSE ATA] 


_ 2AN Pc) Fulda) + FU) pe ” 
“ht le Same Bea) [with N of eq. (A.43)] = 


4ANAP(Aa)P(— Aa) a 
TAGE (As) 2a) Er.) pe)" 


In the last equation the abbreviations #,(1) used in Appendix II were 
replaced by their definitions, see equation (A.22). The last form of equa- 
tion (A.47) was applied to the evaluation of the formula (55) for the current 


at the boundary 2 = 0. 
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II. - Behavior of the P,- solution of the Milne problem with anisotropic 


scattering for L > co. (Algebraic part). 


In Part I it was shown that the partial solutions of the Chandrasekhar- 
Wick method which contain the factor g(u) cf. Eq. (I-27) (°), dependent on 
the directions “= cos Ÿ, are solutions of the non-truncated Boltzmann integro- 
differential equation (I.1). An error occurs only because the Gauss quadrature 
method is used to evaluate approximately the integrals on the right side 
of (I-1). The integrands are not polynomials but are meromorphic functions 
of u; hence, the integral is evaluated only approximately. This error is mani- 
fested in L-approximation as an error in the characteristic equation, which is 
an algebraic equation instead of a transcendental. But the sequence of al- 
gebraic characteristic equations converges for L — co to the exact transcen- 
dental characteristic equation. Therefore one may expect that the solutions 
of the C-W method tend to the correct Wiener-Hopf solution for L + co. 

The solutions (1-16) of the spherical harmonies method, however, can be 
used for a correct integration of the integrals on the right side of Eq. (I-1). 
They are continuous functions in the whole interval —1<4u<1 and their 
u-dependence is of such a kind to satisfy a truncated system of differential 
equations (compare Sect. 1 in I) instead of equation (I-1). The approximate 
characteristic equation of the S-H method coincides with that of the C-W 
method, but the truncation of the infinite system of differential equations (1-3) 
introduces a further, and more serious, error. It is shown in the following 
Sect. 1 that the spherical harmonics method introduces an «error-source » in 
the Boltzmann equation. It turns out that this error-source is important 
especially at those places where the contribution of the transients to the so- 
lution is large. In the Milne problem this region is near the boundary of the 
medium. The error-source does not show a tendency to vanish in the limit 
L—co in this case, indicating a lack of convergence of the spherical har- 
monics method. In a denumerably infinite set of directions, however, the 
spherical harmonics solution coincides with the C-W solution in the limit 
I —-co and the error-source is zero there also. Hence it is possible to devsie 
a scheme for passing to the limit by using only the C-W part of the S-H so- 
lution. 


(*) The numeral I in the number of an equation refers to the corresponding equation 
in the first part of the paper. 


ess, 
a 
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Although the convergence of the angular distribution f(£, 4) given by the 
spherical harmonics method presents serious difficulties, it will be seen that 
the convergence of the total flux f,,(£) shows no such difficulties (1°). Thus for 
applications to reactors—for example, in the calculation of thermal utiliza- 
tion—where the spherical harmonics method is used really to compute f,,(¢), 
not f(¢, u), subtle questions of convergence do not arise: the spherical har- 
monics method, used in the usual straightforward way, does converge to the 
proper answer in the limit L —co. The performance of this limit-process, 
however, will be published in a later paper; it is partially carried out in the 
author’s ORNL-report No 2358. 


1. — The error-source in the Boltzmann equation caused by spherical har- 
monies solutions. 


If one puts the solution f(ê, u) (1-16) of the spherical harmonics method 
in P,-approximation in the Boltzmann integro-differential equation (I-1), one 
obtains a remainder 


+1 +1 
0 ; ed C DE ar I TL 4 TPE ! 
MS RE, u) = TE | je 5 Je wa Sg es fre nera = 
(L=1))2 f c + ax P;n(u) | | 
x à | Ae PRÉ REP ATA + auÂxe , 
2 expt tia) yu | sn ig ea | RS 


using the integrals (I-51, 54). In (1) all terms which do not contain spherical 
harmonics cancel one another. Hence, the remainder is 


a Pin(u) no 
@Q) Rp) = Za exp [EI + adi) pa) = 
fa P;n(u) 5 2 P4(0) = 
ue trs 
on Prn(u) 11 
(3) LA ACE 


The last equation was found by comparison of the > with equation (I-18) 
k 


(°) For further information and illustration see the section « Convergence of the 
Spherical Harmonics Method » in Chapter IX of A. M. WEINBERG and E. P. WIGNER: 
Theory of Neutron Chain Reactions, Preliminary Draft, 12-9-57-Revision, Oak Ridge 
Nat. Lab., Oak Ridge, Tenn., USA. 
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for u=0. Therefore, the remainder is closely related to f"(e, 0) i.e., to the 
difference between the spherical harmonies and Gauss quadrature method 
solutions for u=--0, Instead of fulfilling the Boltzmann equation (I-1) 
exactly, f(z, u) = f'(e,) +f"(2, 4) satisfies the equation 


+1 ET 
Li : ee I 3 y MIE; Pits P;n(u) I/ 
To (yu) ff; 1 )du'+3(1-yaulf(î, Hu du + P,u(0) | (2,0), 
4 > 


1 


with a «27*'-pole-source », the strength of which depends on the part f(z, 0) 
of the directed flux perpendicular to the z-axis in every ¢-plane. 

This result is valid also for every partial solution contained in equation 
(I-16) with a corresponding partial remainder contained in equation (4). 
Therefore it is not a special feature of the Milne problem, but a common 
feature of all spherical harmonics solutions. The remainders of the different 
partial solutions, however, may accumulate or destroy one another. One re- 
cognizes that the true Boltzmann equation can be satisfied for arbitrary 
u(—1<u<1) only for problems, in which the boundary conditions are of 
such a kind that f(z, 0) tends to zero in the limit L + co as 1/L* with «> 4. 

This will presumably not happen in the Milne problem, because in this case 
f(z, 0) has just the task of joining smoothly the directed fluxes of medium 
and vacuum side at the point of discontinuity “= 0; hence one cannot expect 
that Jim f"(z,0)=0. On the boundary between medium and vacuum we ex- 
pect lim f"(0, 0) = — 43 lim f'(0, 0), if the spherical harmonics series tends to 4 
of the discontinuity of the exact Wiener-Hopf solution. Normalized to unit 
density at the boundary the exact values are f(0, u=+0)=3, f(0, u = —0)=0 
with a discontinuity of 4 in the case of isotropic scattering without absorption. 
The spherical harmonics method yields with the same normalization in L = 3 
approximation f(0,0)=0.3737, in L=5 approximation f(0, 0) = 0.3382, in 
L=15 approximation (*) f(0, 0) 0.2898. This behavior does not contradict 
to the expectation above, according to which one expects lim f(0, 0) = } or 
lim /"(0, 0) =— 4. 

For the special directions u =p; (j=1, 2, …, (L+1)/2) the error in (4) 
vanishes, because P, ,(u;) —0. From equations (1.16), (1.17), (1.18) we know 
that in these directions f"(2,u;) = 0 and f(2, 4;)= f'(2,;) which is identical 
with the Gauss quadrature solution for u= --u, and is zero for u=—H; 
(this last was our boundary condition for the Milne problem in the spherical 
harmonics method). Therefore we may fulfil the boundary condition and the 


(*) I owe the calculation of the P,;-approximation to Mr. H. $. Moran, Oak Ridge 
National Laboratory. 
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true integro-differential equation in the limit L +0 at a denumerable infinite 
set of directions «= +yu;. In these directions the spherical harmonics- and 
Gauss quadrature-method solutions coincide, in all other intermediate directions 
they differ and the spherical harmonics method leads in general to a non- 
vanishing remainder in (4). Therefore we have to select in the limit L — oo 
only the denumerably infinite set of directions «= u,, in which both methods 
coincide, if we wish to proceed to the exact limiting function of the Wiener- 
Hopt method. 

The following consideration shows the behavior of the remainder in equa- 
tion (4) as function of u at a fixed e. If f"!(e, 0) is finite and different from zero, 
we obtain the quotient of the remainders at the end-points of the u-interval 
[4|=1 and the center u=0, (L+1 even), 


R(C, 1) E Pr4(1) = (— 1)e+v2 2:4:6-8 (LZ +1) 
R(C, 0 PaO) PTE ONCE By 
This quotient is 2.67 for L=3, —3.2 for L=5, 5.09 for L—15. For large 


IL it diverges as W/Z 


(5) ots 
RC, 0) 


= (1) VE Vie 1 


Hence the remainder in (4) increases strongly from the center to the end- 
points of the w-interval. 

The divergence proportional to VL, however, occurs only in the narrowest 
neighborhood of the point w—1. One recognizes this fact if one considers 
the quotient of the remainder at the extrema y” of P,,,(4) and the remainder 
at the center u=0. The abscissae 47 of the extrema, for which P’,, ,(u”)=0 
are approximatively ur, ~ cos ((m++4)/(Z+3))a for large L, where the nume- 
ration m= 1,2, ..., (L+1)/2 begins with m = 1 for the largest abscissa between 
the zeros u, and x of P,,.(4;)=0[u,<wf<,] and ends with wi, ,1,2==008 71/2 
for m=(L+1)/2. The extrema are 


9 


(Z+1) 


Pesa RA \/- [1 al ’ 


for large ZL. Hence the quotient of the remainders at fixed 2 for large L is 


R(C, Um) (+10/2-m[] __ (,#)21-4 


The absolute value of this quotient is independent of L and increases slowly 
with the distance from the center. The divergence proportional to WL 
occurs only in the small interval between the largest zero of P,,,(u), 
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pu = cos [(37/2)/(2L+3)] & 1 — }((377/2)/(2L+3))?<u<1; this interval in the 
neighborhood of u=1 decreases as L~° for large L. 

Hence the spherical harmonics approximation shows two possibilities for 
a lack of convergence. (1) If f"(z,0) does not vanish in the limit L — oo, 
an event, which may be caused by the physical conditions of the problem; 
(2) If f(z, 0) does not vanish strongly enough to reduce the remainder to zero 
also for |u|=1; this is the case, if f"(z,0) does not tend to zero stronger 
than 1/VL for L oo. 

In our case of the Milne problem one observes a quick decrease of the re- 
mainder (3) from the boundary to the interior of the medium ¢<0. To 
recognize this fact we separate the x; and «,-terms from the sum in the re- 
mainder (L+1 even): 


(7) Ru) = — flo + at) 5°10 (as exp [— Eau] + a exp [E/An]) + 


+ SE ci, exp [t/A(e + af) pit 


À lies mostly outside of the real interval (— 1, 1) and is usually large; in this 
region we have for large L 


_ et (as — 1) 


8 Pit 
(8) he V2n(L-+ 1)(42— 1)? 


/ however lies in the interior of the real interval (—1, 1) and here is 


Se Nunc 3 
9 P ‘iad D Ce eae rt it] => eed ( 5) ALC 
(9) z41(u) a Ving sing with @ 7 + na + 5 are COS 4 , 


for large L and 1—e<u<1+e (e small). Therefore the quotient 


Pry(4) | V2sinp 


Prig(Ag) (L= (21 ri) 


tends to zero for L co in this w-interval. The last statement remains 
valid at u = 1 because P,,,(+1)=1 (L+1 even). Hence the first term in (7) 
plays no role in the limit L>co; the remainder contains in the limit L — CO 
only the contributions of the transients (with 0 < À, < 1) 


(10) lim PIG AV ims ey à Pr (4) 
ae Moa) = lim | 2 e expLtiAlie + ads) pe 
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The meaning of this fact is that the spherical harmonics method produces a 
remainder in the angular distribution just in those points, in which one wants 
better information than diffusion theory can give. 
. Introduction of the values (1-39) of a, in (9) leads to 
av exp[6/2][1—(a/6)2, O(a) I] 


if ares f 5 42) 6 2 De = 
(11) lim R(¢, 4) = lim {A (e+ 440)24P(1)p( n) > AC Br À) J 


Because the sign of 7’(A,) alternates with s, the sum in (11) is the difference 
of two infinite series with alternating terms. This renders a general state- 
ment of the behavior of (11) difficult. 


+1 
One may use equation (4) to form the moments (ens pu) du. Because 
—1 


+1 
the remainder is proportional to P,,,(u) and Ju'P,,,(u) du = 0 for O<I<L 
=I 


the relations between the moments of f(£,u), which one could derive on the 
basis of the correct integro-differential equation (I-1), are the same in the 
spherical harmonics P,-approximation up to /= L. One of those relations is 
equation (I-25), it is the case J=0. In general, if we denote the moments 


eh 
by male) = fu’ fC, u)du the correct integral equation (1-1) and equation (4) 
=1 


for P,-approximation in spherical harmonics method give equally the following 
relations connecting the moments 


| as mo(Ë) for 1 even 
0 had 
Be Miss(C) + MAÉ) = 1 

Seer o) m;(£) for 1 odd 


for linearly anisotropic scattering and 0 </< L. 


2. — The functions, which tend for positive and negative y in the limit L — co 
to the exact directed flux. 


We put in this section and index Z on the approximate P,-spherical har- 
monics solutions for the directed flux f,(¢, u) = f,(£, u) + fi (6, 4) (1-16) which sa- 
tisfies the Boltzmann equation (4) with the error source term 


(13) a PR 41,6, = Flew) 
where 


P;n(u) 


+1 5 ; 
(14) FC, u) = 3 (i= Ya) ue, Ll’) du' + 3 (i von fuit u') du' + P,:(0) (C3 0). 


= = 


RURAL A 
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The exact directed flux, which satisfies the Boltzmann equation (1-1), may 
be denoted in this section by f(ê, u) without index: 


(15) fl dt + f(î, u) = F(, 4), 
where 
+1 
(16) F(î, p) = sl fic Co = (Lye [nie ow!) du’. 


i 
If one considers u for the moment as a fixed parameter and solves equations (13) 
and (15) as inhomogeneous linear differential equations, one obtains 


rye | 
UD HE —expl— ta] lou +7 (PC 10) esper} 


for functions with or without index L, differing only by different « constants » 
C(u). The constants C(u) are determined by the boundary conditions. For 
large negative ¢ we have the asymptotic behavior f(£, u) — exp [—¢/A,,,] and 
f,(C, u) ~ exp [— C/A,] respectively, with Z,, and À, > 1 for prec u; hence 
one may use, for positive È the boundary conditions exp [£/u]f(£, 2) > 0 and 
exp [C/wjf,(C, u) > 0 for ¢ +-— co; these conditions determine the I 
C(u) and C,(u), OI UT, This yields in the first case (exact solution) 


— © 


. = Ça Ÿ L Se, Pod À fel 
Jim exp [C/u]F(C, u) = C(u) + 7 |P, pu) exp [C'/u]dc’= 0, 


. 
0 


is 
(18) Cw) == | F(', a) exp [£"p] at! (<<), 
and pi 

(19) fee 2 tel Ala 2, u) exp [Eu] ae! 


In the second case (P,-approximation) the source boundary condition leads to 


0 
il 


(20) Cu) = 7 [Ps Eu exp [Ed ar (<p <a), 


eni) 
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(4 
(21) fia) = ur SH] | (er, 1) exp [Cif] ae’ 


— o 


If we take the difference of the exact and the approximate directed fluxes 
in the limit L — © we obtain for positive u 


A 5 
(22) f(C, 4) — lim fu) = Ren [—¢/u] Lim [LPC 210,10) exp[¢'/u] dc’. 


—_ D 


We now assume that the approximate density and current tend to the exact 
value for L + co; hence the corresponding terms cancel in the integrand on 
the right side of (22) and the error source term in /,(î, u)—compare equa- 
tions (14) and (16)—alone remains: 


: k Pr / er! =) 
(23) 6m) lim AE, 0) = —Texp[ Efe] im pe (uf fic’, expe! p] ae’ = 
1 Ù (Z—1)/2 P ( ) i i il 
= —exp[— C/u] lim RATE DAV PB) exp |& È |a 
u p [ Elu] SS na x( I x) Pal An) P È u + da 6 


rAr i oT Pe 
= exp El Him NC 1) exp El 4) Poly) 


Fe Ore (¢ a5 ai: ) P;n(u) aa" 
= lim DA LD x exp [Z/Ax] FR = lim fr(£ a) n lO MEME 


Evidently one may include the direction u=0 for ¢<0 in formula (23). 
Therefore one finds again that the Gauss quadrature part fi (î, u) tends for po- 
sitive u (forward scattering) to the exact directed flux 


(24) fC, 4) = Jim [f,(6, 2) — FC, u)] = Lima 1G; 2) (0<4<1). 
For negative u (backward scattering) we have different boundary conditions 
for the exact and the approximate functions, which determine the con- 
stants O(u) and C,(u). Im the first case we may use the condition of no 
flux from the vacuum to the medium at the boundary é=0 %.e. f(0, u) =0 
for negative u. Equation (17) yields in this case 


(25) f(0, u) = C{u) = 0 (—1<yu < 0) 
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and we have simply 


exp[—¢ cle) |p 
Peretti 
0 


(26) Ho, 1) = p) exp[£'u) dt’. (—1<u<0). 


For the approximate directed flux, however, this boundary condition holds 
pee for the discrete set of points w——u, (j=1,2,..., (L+1)/2)), where 
f,(0 )—0. In general we have a directed flux f,(0,u) backward from 
the. vacuum to the medium at the boundary €=0. Hence equation (17) 
gives 


(27) C, (4) és f,(0, 4) (—1<yu< 0) 


and the directed flux in L-approximation is 


È 
(28) f,(î, u) = exp[— C/u] ‘0, u) +7 [Rates exp[£'/u]di'|. (—1<yp< 0). 


Taking again the difference between the exact and the approximate directed 
fluxes in the limit L — oo we find for negative u 


(29) fp) — lim f,(6, n) = 
= —exp[—£/] tim { Fifa 2,0) — PC, w)) exp (ae) = 


pee 


= — exp [—¢/i] im {hl0, 1) + mall FE" 0) exp [eq] ac’) 


The last integral is 


È È 
P, 1(4) ia 1 ! er P; 1 i 
Pa, (0) | 10's exp [Lp] dl'=— Dale + at) 2800 | exp GE +2) ero 


Pii(— À) 


0 


xx i L+1 | 
ge Pet) of) 


este Ax Pralt Ax) 
By introducing this in equation (29) we get 
Î(6, 4) — lim f,(¢, 4) = 


hr k L+1 
= fim {— 1,0, p) exp[-c/u] + 5" ee ee) exp [t]aJ-exp[—t] mf 


= lim {— 7.(0, 4) exp [— Eu] — iC, w) + (0, we) exp [— C/u]} = 


= — lim {f:(0, 2) exp[— î/u] + ANC, w)}. (-1<4u<0),) 
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remembering equation (I-18) and f,(0, u) = f,(0, u)+f}(0,u). Hence, we see 
that the exact directed flux f(¢,m) will be, in the limit L + co, the sum of 
two terms, both containing only the first part f, (the Gauss quadrature part) 
of the spherical harmonics solution f, 


(30) f(€, pu) = lim [7,(6, w) — fi(0, 2) exp [—- Eu]  (—1<u<0). 


The second term removes the poles of f,(£,u) at u=-— 4,; this can be seen 
quickly by remembering the expression (I-17) of f}(î, 4). We therefore have 
the following representation of equation (30) for negative u 


G1) (Cy) = lim DO cade |— ada cene 


i ne 


(exp [¢/A,] — exp [— ÉJu]) . 


In equation (30) both È and y are negative, their quotient C/u is always posi- 
tive. In the interior of the medium—excluding the boundary ¢ = 0—the 
term exp [— é/u] vanishes with u — 0. Hence, we have a continuous tran- 
sition to the expression (24) for positive uw. On the boundary ¢ =0 however, 
we have to exclude the direction y = 0 perpendicular to the z-axis from equa- 
tion (31) and we obtain f(0,~)=0 for all «<0. In this way the well-known 
discontinuity arises between the expressions (31) and (24) for u=0 on the 
boundary of the medium $=0. 
ORE 
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SOMMAIRE. — 1. Introduction. — 2. Les solutions principales de l’équation 
de Klein-Gordon. — 3. Les ondes planes généralisées déduites des fonc- 
tions elliptiques de Jacobi. — 4. Les solutions ondes planes des équations 
d'ondes non linéaires []1y+xy+yy?= 0.-5. La composition des fonctions 
d’ondes dans les théories non linéaires. — 6. Solutions invariantes et so- 
lutions radiales des équations précédentes. 


1. — Introduction. 


Je me propose d’exposer ici quelques résultats que j’ai obtenu dans la 
recherche et l’étude des solutions de quelques types d'équations d’ondes non 
linéaires susceptibles de généraliser les équations d’ondes de la mécanique 
ondulatoire. 

De nombreux auteurs ont déjà cherché à introduire des équations d’ondes 
non linéaires en partant soit d’une étude phénoménologique des interactions, 
soit en cherchant une théorie non linéaire dont la théorie quantique des champs 
soit une approximation. 

J'ai adopté un tout autre point de vue en cherchant si des considérations 
très générales pouvaient sinon indiquer exactement, tout au moins restreindre 
les classes d'équations d’ondes non linéaires susceptibles d’être introduites. 
Partant d’une analyse des types de solutions de l’équation d'ondes de Klein- 
Gordon, j'ai été amené [16-18] à en examiner les généralisations accep- 
tables. Réciproquement, ces généralisations satisfont à des équations d'ondes 
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généralisations de l’équation de Klein-Gordon. Dans le cas des ondes planes 
Jai été amené [16] ainsi à retrouver un type d'équations non linéaires déjà 
rencontré par R. FINKELSTEIN, R. LE Leyier, M. RuDERMAN [6], L. SOHIFF [20] 
et N. Rosen et H. B. RosENSTOCK [19]. 


2. — Les solutions principales de l’équation de Klein-Gordon. 


La mécanique ondulatoire usuelle représente les corpuscules sans spin par 
des fonctions d’ondes (x, y, 2, t) solutions de l’équation de Klein-Gordon 


yo jo = 0, 
(1) 1 © o? i | | 06 ne h 
apr naa gail ET mae 


Bien que des théorèmes généraux montrent l’équivalence de tous les systèmes 
complets de solutions des équations d’ondes, les applications de la mécanique 
ondulatoire montrent qu'il est nécessaire suivant les problèmes examinés, 
d'utiliser des systèmes de base adaptés possédant par exemple des symétries 
particulières. Dans une généralisation non linéaire de la mécanique ondula- 
toire qui semble souhaitable à différents égards il est possible que certains 
systèmes de base soient seuls à considérer, l’équivalence entre système pouvant: 
ne résulter que d’une dégénéréscence associée à l’approximation linéaire. 

Suivant les problèmes étudiés les principaux types de solutions de l'équation 
de Klein-Gordon sont: 


a) les solutions du type ondes planes, 


b) » » ondes invariantes, 
€) > » ondes sphériques, 
d) » » ondes guidées. 


a) Les solutions du type «ondes planes » s’obtiennent a partir de l’équa- 
tion (1) en supposant que les fonctions y(#, y, 2,t) ne dépendent que d’une 
seule variable soit 7, combinaison linéaire de #, y, 2, t 


[Wt— (px)] = Ket — (Kx), 


par l’intermédiaire de quatre constantes (W, P,, P,, P;) ou (K, K,, K., Ks) 
telles que 


We cp + mc cou K* Ki ue 
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La fonction y(t) solution de (1) est alors solution de l'équation différentielle 


d2 
(2) aM + y(t) = 0. 


La solution générale de (2) est une combinaison de deux types de solutions, 
les unes paires y,, les autres impaires y, 


(3) yp, =A cost, y = Ban. 


Les fonctions y(t) = p(Ket—(Kx)) peuvent être considérées comme résul- 
tant d’une transformation de Lorentz appliquée a la solution particuliére du 
système propre y(t), solution de (1) indépendante de #, y, 2. On a alors 


9 


27t 
T= Let = <> moc?t . 


Les ondes planes (3) soient y(t) forment un système complet de solutions 
de (1), fonctions de 7, uniformes, périodiques et d’amplitudes bornées. 

b) Les solutions invariantes s’obtiennent à partir de (1) en considérant 
les solutions particulières de cette équation qui ne dépendent que d’une seule 
variable celle-ci étant un invariant relativiste. 

On prend généralement pour cette variable 
(4) u= Ve? —r, 
ou 
u = ct? — (a? + y + 22). 


On voit facilement que 


(5) O = 


L’équation (1) détermine alors p(w) par 


d2 
du? 


L 3 d 
6 SES È 
(6) +5 Gy + HO] lu) = 0. 


C’est encore une équation différentielle dont la solution générale s'exprime 
au moyen de fonctions de Bessel d'ordre un 


A B C 
lt) = ya) + Maus) = CRE) + È HP un) 
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(J,, N; fonctions de Bessel de première et de seconde espèces, H” et H® fone- 
tions de Hankel d’ordre 1 correspondantes). 


c) et d) Pour l’introduction des ondes sphériques et des ondes guidées [18] 
nous allons maintenant supposer qu’il existe un repère privilégié R, dans lequel 
les fonctions d'ondes y(x, y, 2, t) s'expriment sous forme du produit d’une fonc- 
tion de t, y.(t) et d’une fonction des variables d'espace, »,(z, y, 2) ou y(r, 9, œ): 


(8) p(x, t) = pi(t)yp.(a, y, 2) = y(t)pi(r, 0, p). 
Nous aurons alors 


1 dy, 
Pa) di ane 


(1) — y(t) Ay,(x) + AA =. 
Introduisant deux constantes, Z,, A, telles que 
At a da = Hi ’ 


Y1(t) et w,(x) satisferont aux équations 


e dé (t) + Ax wa (8) =0, 


| Ay,(a, Y; 2) + Aspo(a, Yy, 2) =0. 


(Nous supposerons À, et 2, réels et nous nous bornerons ici au cas À, > 0 afin 
de ne pas introduire de solutions de type évanescent par rapport à t. Ces 
solutions amorties au cours du temps ne doivent pas étre écartées dans une 
étude générale que je ne fais pas ici). 

Nous obtenons alors pour la fonction y,(¢) 


(10) y(t) = € exp [t/t] + © exp [— iV Act] = ©, cos (VA;et) + €! sin (V7, ct) . 


Pour y.(x, y, 2) deux cas sont à considérer: 


tappa 0. 
Aw, + Aas = 0, 


admet alors pour solutions acceptables 
1 7 m 
(11) walt 0, 9) = ATV) + BM a (V Arig) ; 


Der, 410. 
Ay, — | 2 | Po = 0, 


35 - Supplemento al Nuovo Cimento. 
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a pour solutions restant bornées quand r — co 
A Vid ne 
(12) Ya(r, 0, P) RE VT Ki43( | Aa In)yi (0, p) ‘ 


Si nous nous bornons au cas J = 0 nous n’avons plus à considérer les fonc- 


m 


tions sphériques y7(0, g) et il reste 


(13) Pa(r, 0, p) = v2(7). 


On a donc dans les cas ci-dessus 


a Di gr sin wae + pr °° a 
A" e 
(15) pa(r) = — exp [— VIA |r] « 


Les solutions dites «ondes sphériques » de la mécanique ondulatoire ortho- 
doxe sont obtenues à partir de ces expressions en posant 


W? p? 
(16) ME ga Re EY, A, = = IK, 
d’où A, — A; = K? —| K/? = pw. 


On a alors nécessairement À, >; et pour 1 = 0 la solution onde sphérique 
générale s’écrit 
cos| K |r 


oY, | pete ah me 
T 


(17) an = yalt)yna(r) = [cf cos Ket + c! sin Ket} A’ |S! KI 
r 


ie sin Ae | K |r) Le cos (Ket | K|r) 
r 


La mécanique ondulatoire orthodoxe considère également le cas parti- 
culier des solutions ci-dessus pour lequel on a 


(18) h=0, y= Palr) . 


On est alors dans le cas 2) ci-dessus. 1, — 0 entraine |4,| — ui et 


(19) vy = pr) =.) = exp [— por] 


ees ns ma so sr Li 20 = cs î se n ee È TRAI JE. ¥ ‘ Pi af 
7 | 
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Cette solution, en fixant la valeur de la constante O,, est considérée comme 
représentant le champ y(r) créé par une source C, localisée au point r = 0 dans 
le système propre du corpuscule (ici le repère R,). 

On passe des solutions y —y.(t)p:(x, y, 2) avec wi(t), y,(x, y, +) donnés par 
(10), (11) et (12) ou (10), (14) et (15) aux solutions du type « Ondes guidées » = 
pour lesquelles le corpuscule est localisé et décrit une trajectoire (rectiligne et $ 
uniforme en l’absence de champ extérieur) en effectuant sur les fonctions y 
une transformation de Lorentz dépendant explicitement du temps. 

Pour un corpuscule se déplaçant le long de l’axe OZ avec la vitesse © nous — Gg 
poserons ue: 

ct = cosh yet'— sinh yz", 2 = cosh ye’— sinh yet’, È: 


If 


LEI, y =Y'; tgh y =v, 
d'où 7e 


ra= 2° + y'° + cosh? y(2— tgh yct'}, fe 

Vit = V(cosh yet'— sinh y2') . À 4 

Ecrivant encore | p 
VA, = Un K,= y, cosh y, K, = , sinh y, 1 

Vit = K,ct'—|K,|2", “à 


2 
ra = 024 y'24 (=) [et ot! ]? = pa 
Mi 


nous obtenons alors l’expression de 1’« onde guidée » 


(20) yp (x, ya,t)= 3 % 


a fai COS 
ia cosi 


(K, ct’— | K, |2") 


\ sin da 4 r 008 (Ve) :. 


J | fe a 
Le terme en A” introduit une singularité polaire qui se déplace avec la vitesse v 
(0'=0 pour æ'— y'=0, z'=vt). Même si l’on se borne à la partie régulière, 
une structure définie dans R, par une combinaison de solutions du type (10), i 
(11) et (12) ou (10), (14) et (15) engendre une solution combinaison des Di 
y' (0, y', 2',t') ci-dessus déplacée avec la vitesse v. x 
La solution particuliére 
Co EXP [— Mor] 

yy) = oP 
conduit a la solution guidée 
co EXP [— fo(x'? + y'? + cosh? y(e'— vt')?)?] 


[oy cosh ae = ot’)? |} i 


(21) ya’; Gee De 
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interprétée ordinairement comme champ de Yukawa d’une source en mou- 
vement rectiligne et uniforme de vitesse tgh y = v. 
De même les solutions du type onde plane rentrent dans le schéma des 


ondes guidées lorsque nous posons À — 0, À — 0, y = yi(t). 


3. — Les ondes planes généralisées déduites des fonctions elliptiques de Jacobi. 


Dans une extension de la mécanique ondulatoire basée sur l’équation de 
Klein-Gordon nous devrons généraliser soit l’ensemble des types de solutions 
que nous venons de considérer soit seulement certaines d’entre elles que des 
raisons physiques nous conduisent à considérer comme rattachées plus directe- 
ment à la représentation de la matière. 

Si nous considérons d’abord les solutions du type ondes planes, nous avons 
vu qu’elles pouvaient être considérées comme résultant d’une transformation 
du groupe de Lorentz appliquée aux solutions particulières du système propre 


(22) y, =A’ sin % , wy, = A” COS To; 
avec 

27 o 
(23) To = pei = 2% meet = Wel . 


Cette forme de solution met en évidence un caractére fondamental de la 
représentation des corpuscules en mécanique ondulatoire sur lequel M. L. 
DE BROGLIE a souvent insisté: Dans le système propre du corpuscule la fone- 
tion d’ondes associe à celui-ci une «horloge », c’est-à-dire une fonction pério- 
dique du temps propre de période T = h/m, c? (ou de fréquence », = moc?/h). 

Si nous voulons généraliser cette conception tout en essayant d’enrichir 
la notion de corpuscule en introduisant non plus la seule constante intrin- 
sèque v9 = m,c?/h mais deux ou plusieurs constantes, la généralisation la plus 
immédiate consiste à prendre comme fonction d’ondes représentant le corpu- 
scule dans son système propre, au lieu des fonctions circulaires cos t ou sin 7, 
certaines des fonctions elliptiques de Jacobi possédant une période réelle et 
une période imaginaire pure. La définition de ces fonctions introduit un para- 
mètre k réel, compris entre 0 et 1. Pour k = 0, ces fonctions redonnent sin 7 
et cost. La donnée de % équivaut à introduire un paramètre intrinsèque 
supplémentaire. 


La théorie des fonctions de Jacobi introduit trois fonctions principales: 
sn (u, À) de périodes 4K et 4iK', 
en (w, k) » 4K et 4iK', 
dn (u, k) » 2K et 2iK'. 
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Les périodes K(k) et K'(k) sont définies par l'intégrale 


nl2 
do 
K(k)= 
se) a 
0 


et par 
(he) == K(k’). avec k’* = 1 — k?. 


A partir de ces trois fonctions on construit un systéme de 12 fonctions 
elliptiques en adjoignant à sn, cn u, dnw leurs inverses et leurs quotients: 


1 1 
DU, neu = — , ndu=-—-, 
nu cn dn wu 
sn u sn % 
secu — tn u = ; sdu = -— , 
cn 4 dn wu 
cn # cn % 
CU —=—, ed u = —, 
sn dnu 
dnu dnu 
dsu = —, deu = —, 
w cn % 


un a notamment entre ces fonctions les relations 
sn (u + K, k) = cd (u, k) , 
en (u + K, k) = — k' sd (u, k), 
dn(u +K,k) — k'nd(u,k), 


sn (uv, 0) = sin, sn (w, 1) =tghu, 
1 
cn (w, 0) = cosu, lia 
d 0).= 1 a 5 È 
n(w, 0) =1, eine 


On trouvera dans de nombreux livres de mathématiques appliquées l’étude 
des propriétés de ces fonctions. A titre indicatif je ne citerai que les ouvrages 
d’APPEL et LACOUR [1], de GREENHILL [9], de TRICOMI [22] et l’excellente 
petite monographie de BOWMAN [2]. 

La généralisation des fonctions d’ondes solutions ondes planes de l’équation 
de Klein-Gordon nous amène à poser u=T d’où 


2 2 
(24) T = 4K(k)nt = 4K(k) oe = t= Wet. 


$ NT SION 
ra? 
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4K est ici l’analogue du facteur 27 du cas trigonométrique et ceci nous 
conduit à introduire une nouvelle constante de Planck réduite 


(25) i! = 


remplaçant la constante usuelle # = h/2x, mu, sera lié à la masse dynamique mp 
par l'intermédiaire de #', 


Mot 


(26) i 


Dans le système propre, nous avons la possibilité de définir des fonctions 
d’ondes doublement périodiques, paires et impaires, se réduisant respectivement 
à sin 7 et cost pour k= 0 suivant deux choix 


a) soit sn (t, k) et cd (rt, k), 

b) soit en (t, k) et sd (t, k). 
En outre, nous pouvons définir une fonction d’ondes doublement pério- 
dique se réduisant pour 4 — 0 à une constante en considérant les fonctions 


c) dn (7, k) et nd (zt, k). 


Nous allons maintenant examiner les équations différentielles du second 
ordre que le choix de ces fonctions nous conduit à adopter pour généraliser 
l'équation 

2 


aa v(t) + y(t) = 0. 


Pour cela nous examinerons les équations différentielles du second ordre 
dont les solutions sont les fonctions elliptiques de Jacobi. 
A) L’équation 
Ye + (1 — 2k*)y* + ktyt— kb? = 0, 
a pour solutions 
y=, cnu si y(0) ==. 1, 


y=k'sdu Si y(0).=0. 
Par suite 


y" + (1— 2k2)y + 2k2ys = 0, 
a pour solutions | | | 
y= cnu si y(0)=1, y'(0)=0, 
y =k'sdu Si y(0)}'= 0, #(0)= kK’. 


LE on di 
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B) y®+(.+ by? —kyt—1=0, 
a pour solutions 
y = snu Si y(0) 0, 
y = cdi si y(O}s= 1. 


Par suite 


y+ + ey — hy? = 0, 
a pour solutions 
y = sn u pour Y(0) = O0 y (ONU 
y= cd pour VE a0) 0: 
C) L’équation 
YEN + yt + bk? = 0, 
a pour solutions | 
| yee ‘inv pour y(0) = 1, 
y=k'ndu pour 4(Q) = ke. 
Par suite, 
y'— (1+ ky +29 = 0, 
a pour solutions 
y= dny pour YO) =, y (O)-== 0 ; 


 y=kndu pour y(0)=k, y(0)=0, 


Revenant des équations différentielles vérifiées par les fonctions y(t) aux 
équations aux dérivées partielles déterminant les fonctions y(a, y, 2, t) on voit 


immédiatement par correspondance que 


aes Te Tel 
oz) ; i 
y, = Ak’ sd [ (Ket — (Kx)), k], 


sont solutions particulières de 


Co . Oy + — 2h yup + ou SI 


(ue = K°—|K}). 


661° 
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| | y, = À sn [(Ket — (Kx)), k], 
29 
| vi = 2ed [(Kot — (Kx), E], 


sont solutions particuliéres de 


Qh? 
(30) Oy + (1+ Buy RE y = 0. 


[ vn= dn[(Ket—(Kx)), k], 
C) (31) 
[Vas Ak' nd [(Kct — (Kx)), k], 


sont solutions particulières de 


Ta 
(32) Cy— (+ Kad + SE yt = 0. 


Les équations (28), (30), (32) ont été déjà rencontrées par de nombreux 
auteurs notamment L. SCHIFF [20], N. Rosen et H. B. ROSENTOOK [19], 
R. FINKELSTEIN, R. LE LEVIER et M. RUDERMAN [6], B. J. MALENKA [13], 
D. IVANENKO [11]. 

Ces équations s’écrivent d’une façon générale 


(33) Op + ap + yys = 0, 


a et y désignant deux constantes. 


4. — Les solutions ondes planes des équations d’ondes non linéaires CJy + 
FH UNE À Alpen O 


Réciproquement nous allons utiliser les résultats ci-dessus pour caractériser 
les solutions ondes planes des équations (33) que nous répartirons en quatre types 


(A) Up +uiy + my = 0, 
(B) Op +uiy — my = 0, 
(C) Up — ip + my = 0, 
D) Op — py — my = 


(34) 


Plus précisément, nous allons déterminer lorsqu'elles existent sous des 


conditions que nous préciserons les solutions de ces équations du type ondes 
planes d’amplitudes bornées. 
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A) L’équation (A) admet pour solutions les ondes planes 


(35) y = 2cn[(Kct — (Kx)) + &, k], 
avec 
RA | K | = Lo ’ 
MC 
he 
en déterminant w, et k par 
2k? ys ; 
A2 pi, =, 


d’où 
[eo = pui 7 pad? ’ 
(36) | EIN 
2 (ui + pià?) 


Ici on a toujours 0<k?<3. L’onde plane n’est jamais apériodique. 
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La 


masse dynamique réduite 4, est toujours supérieure à w, tandis que la masse 


dynamique vraie m, à pour valeur 


huo 
= 4cK(k)° 


Si o>, est fixé, 2 et k sont déterminés par 


2 2 2 2 
AIR at i o — fh 
4 é Hi 26 


Si au lieu de w,, m est donné la détermination de k est plus complexe: 


devra dans ce cas résoudre l’équation transcendante 


hui 
16m2c? * 


(38) (1 —2k?)K2(k) = 


Si l’on se donne les trois constantes w,, us et k, (0 <k? <4) alors 


5 Qh ui 
mi(i — 2k?) ’ 
(39) 
2 2 
Se. Mi 2 ,2 es Mi 
Hee NRC Me 


on 
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Les ondes planes (35) sont solutions de (A) quelques soient les conditions 
initiales y(0) ou y'(0). Pour &—0, y=Acnt, pour & = K, y =Ak'sdt. 


B) Les équations de la forme 
(34B) Dy + my mp = 0, 


admettent pour solutions les ondes planes d’amplitudes bornées 


(40) y = Asn[(Ket—(Kx))+&,k], K?—|K?=yi, 
avec ici 

Ho = hi — fab. 
(41) 

pa Li 44 

…. 26 — pad? 


La condition 0<k?<1 entraîne 


0<A<-. 
Lez 
Ceci correspond à des restrictions sur les données initiales. 
En effet la solution ci-dessus n’existe que si les conditions initiales satisfont 
aux conditions 


et 
(y(0))? <a Qui — V2u5(p'(0))?] . 


Si ces conditions ne sont pas satisfaites, les solutions ondes planes de (B) 
sont des fonctions elliptiques de Jacobi devenant périodiquement non bornées 
et il ne semble pas que de telles fonctions soient susceptibles de représenter 
une structure corpusculaire physiquement réalisable. 

Réciproquement la donnée de v3, u?, u? détermine 22 et k? par 


4 _ 2(ui — pi) 


2 in 


Mo 


(42) 


Pie RE ee EUR x Ci 
> à - x a 
SUR UNE GÉNÉRALISATION NON LINÉAIRE DE LA MÉCANIQUE ONDULATOIRE ETC. 555 


Cette solution devient apériodique pour 2? = uî/u3. Alors ui = pî/2 et 


2 


(43) v.= tgh (Ket —(Kx)), y. 
Ma Ma 


La relation m c= hu,/4K montre alors que si uo = u1/V2 reste fini, K(1) + co, 
la masse propre dynamique m, tend vers zéro. 


C) Les équations du type (C) 
(340) Dy — wy + wy =0, 


admettent quelque soient les conditions initiales des solutions ondes planes 
soit du type Acnt soit du type Adnt. 


LR 
(44) y = Adn[(Ket — (Kx)) +&,%], 


satisfait aux équations (C), u$ et k? étant déterminés par 


292 
(45) Lense 


ke = 


sous la condition 


Pour é5= 0, y= dn; pour ¢,—K, = Mind, 
Pour |A|= nus, k —0, y se réduit à une constante: y = w/u. 
Pour k? =1 soit |Z|=y,V2/u,, y devient apériodique 


(an (u,1) = sata) 


cosh u 


mais alors u? = u?, #'>0. Il faut que la masse propre dynamique m, tende 
vers Zéro. 
Réciproquement si uw}, “i, 4, sont données 


ge — 2h ya 20H 
sd Hi 


sous la condition u?/2<yuÿ <yi . 
PU. ie Ss 
(46) y = Acn[ (Ket — (Kx)) + &,k], 
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avec 
i<k<1, 


est solution de (C). 
(Pour &=0, y,=Acnt, pour = -K, y, — + Ak' sd t). 
uè et k? sont alors déterminés par 


fo = pad? — pi; 
(47) pid? 


sous la condition 


Pour 2? = 2ulu;, k?—1, y devient apériodique 


V2 1 


la COShT 


(48) Po 


Réciproquement si ui, wi, 25 sont donnés 


2 2 2 2 
(49) FASI ne es (ui > ut) 
Ma “Mo 
D) Les équations du type 
(34D) Oy — wy — wp? = 0, 


n’admettent pas de solutions ondes planes d’amplitudes bornées. 
En effet les solutions de l’équation différentielle associée 


d?y(t) 
i — ip — jay? = 0, 


sont suivant les conditions initiales de l’une des formes 


Sn T 
Atnt, hae ANT. 


Ces fonctions doublement périodiques deviennent périodiquement non bornées 


et ne sont pas physiquement acceptables. Ceci conduit à écarter les équations 
du type (D). 
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Dans les trois cas (A), (B), (C) nous avons obtenu des solutions ondes 
planes du type stationnaire. 

La mécanique ondulatoire linéaire considère surtout les ondes planes du 
type « progressif » c’est à dire de la forme 


y = A exp [i(Kct — (Kx))|, 


correspondant aux solutions 


Ww = A exp [+ it | ; (t == Ket a= (Kx)), 
de 

d'y 

ER EN 


On peut se proposer de déterminer pour (34A), (34B), (34C) des ondes du 
même type se réduisant pour u, — 0 aux fonctions A exp [+ ir]. 
Si l’on considère l’équation 


Ys + ya) = 0, 
l'intégration directe donne 
y? + wy? = yo = const. 
%<0 conduit aux ondes stationnaires V7,/w? sin wx et Vy,/@? sin wx 


tandis que 7, — 0 conduit à y = A exp[+iwz]. 
Ici l’équation différentielle associée à l’équation (34A) par exemple soit 


pa + up + ip = 0, 


donne par intégration directe 
2 
(pi)? pe + E nti. 
Xo# 0 conduit aux solutions réelles stationnaires considérées précédemment. 


% = 0 conduit à un autre type de solutions. 
Posant y —1/y on voit facilement que si y, — 0 


il 
©, exp [i,t] — OC, exp [— ut]? 


(50) p(t) 


C, et C, désignant deux constantes liées par la relation ' 


2 


| vbs: 
(51) O01 = Rs 


LL vi | uf , Ul 9 CAT A i RFO ae 
li x 3 wf 
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Posant 1/C,=4,, 1/C,=— À, on écrit encore 
(52) y(t) = —— di si 
ver LAS exp [ut] — (u:/8ui)A exp [— tu,T] 
À 
— exp[— ur] —(u/8ui)As exp [ix 7] | 
ou encore 
i Ay 
(53) y(t) 


exp [i4,7](1 + it 1/8441) — (u3/4pî)Aî COS LT A 
hy 
[+ (u3/8ui) 45] exp [— ut] — (ui4ui)} COS pt 


Ces fonctions sont simplement périodiques. On voit facilement sur ces 
expressions comment s’opére lorsque u, —0 le passage aux solutions du cas 
de Klein-Gordon. 

L’onde plane y(Ket— (Kx)), (K?—|K}?— y) n’est jamais purement pro- 
gressive. A côté du terme progressif figure un terme stationnaire. Ceci peut 
encore s’interpréter en disant que les ondes planes de ce type ne sont jamais 
uniquement à énergie positive ou uniquement à énergie négative. Un terme 
de battement accompagne toujours le terme principal progressif à énergie 
positive ou négative. 


5. — La composition des fonctions d’ondes dans les théories non linéaires. 


Les équations (34A), (34B), (34C) ne sont pas linéaires et la somme de 
deux solutions n’est pas une solution. Néanmoins et c’est 14 un point sur 
lequel je veux insister maintenant, il existe pour les ondes planes solutions 
de ces équations un théorème d’addition ou si l’on préfère un théorème de 
composition. 

Celui-ci va résulter immédiatement des théorèmes d’addition des fonctions 
elliptiques. 

Considérons la fonction enw. On a vu que 


enlu+K)=—k'sdu, sd(u + K) =— enw. 


On peut montrer que les théorémes d’addition des fonctions elliptiques 
tels qu’ils sont donnés dans les traités classiques prennent également la forme 
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suivante 


cnu env + k? sdu sd 


cn (WE) 

(uv) 1+ kenu sdo env sd” 

sd (u + v) = ee tr. 4 
1+k2sdu cnu sdv env 

(54) 

snu cedo + cdu snv 

DISTA cd can de 
“dd 1 

te ane edu edv + snu snv 


1+ k?snuecdv sn edo’ 


Si l’on considère pour un corpuscule représenté par l’équation (384A) des. 
états 7, et 7, auxquels correspondent les fonctions d’ondes 


| peo = Ant, , Ak'8d 7; ; 
(55) 


;(2) 


VA Lt) po = Àk'SAT , 
à la fonction d’état (1) +(2) ou y(r, +7.) correspondent les fonctions 


ME =A Cn(T+T%, Kk), 


| 
(56) 
| yore = Ak'sd (t, +, k). 


Le théorème d’addition donne alors 


)+(2) Alpe pe’ — pe ye] 
SEE (Ik) pe ye eee 
(57) 
yor — Apr ye? + pepe | 


a — JR" 2) yoy? yO p® 3 


La possibilité de construire des fonctions d’état à deux corpuscules à partir 
des fonctions d’états à un corpuscule rend possible la construction d’un espace 
d'états nécessaire pour introduire une seconde quantification. 

La seconde quantification a été généralement considérée comme néces- 
sitant une théorie linéaire. Il me semble que ceci n’est pas nécessaire mais. 
que la seconde quantification est essentiellement attachée à la possibilité de. 
construire des états à 2,3,... n particules à partir des états à une particule. 
Pour cela, il suffit que dans la théorie considérée, il existe un théorème d’ad- 
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dition ou de composition des états, c’est à dire qu’à partir des fonctions repré- 
sentant un état à n particules et un état à une particule on puisse construire 
un état à n +1 particules. 

Les fonctions d’ondes acceptables seront donc celles admettant un théo- 
rème d’addition. Cette condition, nécessaire mais non suffisante, semble a priori 
très large. Néanmoins nous allons voir que l’on peut apporter à la détermi- 
nation de ces fonctions une solution particulière remarquable. 

En effet, WEIERSTRASS à démontré un théorème remarquable (voir par 
exemple le traité des fonctions elliptiques de HANCKOCK [10]) qui répond a 
notre question. 

WEIERSTRASS appelle théorème d’addition algébrique une relation algé- 
brique liant les fonctions Du), D(v), D(u +), et voici son théorème: 

« Toute fonction pour laquelle il existe un théorème d’addition algébrique 
est une fonction elliptique ou l’une de ses dégénérescences ». 

L'application de ce théorème aux solutions ondes planes nous conduit d’une 
façon limitative aux équations d’ondes considérées ci-dessus. 

Toutefois la nature algébrique d’un théorème d’addition des fonctions 
d’ondes ne s'impose pas du point de vue de l'interprétation physique et rien 
ne nous conduit à penser que la nature obéisse à des règles traduites par des 
lois algébriques. 

Je vais d’ailleurs considérer maintenant un exemple simple d’équations 
d’ondes généralisant les équations précédentes et pour lequel il existera un 
théorème d’addition non algébrique pour les ondes planes. 

Pour cela je considère les équations d’ondes non linéaires 

(x) Clyp+usiny =0, 
(58) 
| (6) Cly + ui sinh y — 0. 


Si l’on considère que ces équations sont «approchées » par les équations 
obtenues en remplaçant sin y et sinh y par les premiers termes de leurs déve- 
loppements en séries ces équations sont les généralisations de 


ui 
(x') Oy + wip — y = 
(59) 


2 
(8 Op +pmip + ey agi: 
Si l’on pose alors y = Ag, w74°/6= uî, on obtient pour g les équations 


| (2) Op + nip — ig? =0, 


|) Op + wip + wp? = 0. 


(60) 


a 
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On retrouve les équations des types (34A) et (34B) précédents. 
Les solutions du type «ondes planes » des équations (58x) et (588) peuvent 


s’obtenir sans difficulté. 
Si l’on pose 


t = Ket — (Kx), avec K?—|K|? = y}, 
les solutions «ondes planes » de (58x) et (586) seront de la forme 
y(x, 1) = y(T) , 


w(t) étant solution des équations différentielles 


d2y(t) ui 
he Po p(t TI 0, 
(61) 
Eve +5 i sin (t) = 0 
dt? SUE i 
ou 
[ @) y+ asin yr) =0, 
(62) 


| vit ja sinh (er) = 0. 


Nous allons examiner les solutions de (62x), celles de (628) s’obtenant par 
une analyse parallèle. 

L’équation (62x) est bien connue en physique: c’est l'équation du mou- 
vement pendulaire: 

Alors que l’équation de Klein-Gordon associait au corpuscule dans son 
système propre le mouvement d’un oscillateur sinusoidal, les équations non 
linéaires considérées ici lui associent un mouvement pendulaire. 

Les solutions de 


(580) Oy + ui sin y = 0, 
ne sont définies qu’à un multiple de 27 près. Si y(t) est solution il en sera 
de méme de 


wi(T) = Yo(t) + 2nx. 


De même si l’on pose 


36 - Supplemento al Nuovo Cimento. 
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les fonctions y,(t) satisfont a 
(58y) Cly.(t) + ui cos p.(t) = 0. 
Les solutions de (58x) permettent done d’écrire immédiatement celles de (58) 
et de (58y) 
Pour obtenir les solutions ondes planes de (58) il nous suffit de considérer 
l'équation différentielle (62x) qui par intégration directe donne 
(63) (41)? — 241 cos py = Yo) 


% étant une constante telle que 


12 € 2 
Lo = Yo — 2%1 COS Yo - 


Nous en déduisons 


Ro 3 PAT O 
(64) (WP == (PAT) IE HAT sin? 9? 


et ceci nous conduit à considérer deux cas 


he it 2 
Lans sol Xo>2%1 - 


Posant k? = 4y;/(Y%0+ 2%) 


, 4 : 
(65) (yi)? = Bh» sin? |, 
4% 
2) +2 1 1 } 

d’où 

67 I 4% à eu 
(67) (y)? = uri 1—k? sin? À 

1 


Dans le premier cas, on a immédiatement 


yl2 
d 2V% 

DI Se 
; V1 — k? sin? 0 È k 


ou 
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F(p, k) désignant l’intégrale elliptique de Legendre 


P 
do 
E Ai Vink sind 


Introduisant la fonction am (wu, k) = @ telle que 


sing =sn(u,k) et cosy — cn (u, k), 


on obtient 
(68) . = am {+ vos + &,; t), 
sin + — sn (E TE e) ; 
(69) 
cost = on V2 LÉ, x) 


ce qui exige 


Vi RAT 
se déduit de la solution du premier cas par la relation 


F(y, ky) = kKF (1, k), 
avec 


g,= arcsin (k, sin), 
ou par la formule dite du module réciproque 


sn (ku, k,) = ksn (u, k), 
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qui nous donne ici 


(70) sin? = ksn(Vyt +84). 

Nous avons done dans le cas considéré des expressions simples au moyen 
des fonctions elliptiques des ondes planes solutions des équations d’ondes (58). 

Il existe encore ici un théorème d’addition pour les fonctions d’ondes solu- 
tions du type ondes planes. 

En effet, soit 


T = K,ct — (K;x), Ta = K,ct — (K,x), 
avec 


Ki — (K,)?= Ki — (K,)?= ui ’ 


w(t,) et w(7,) désignant les solutions précédentes, y(t, +7.) s'exprime au moyen 
de p(t,) et de y(t). 
En effet nous avons 


y(t) = 2 am | T + 3 1 
Le théorème d’addition des fonctions am nous donne 


(71) am (u, + «,) = arctg (tnu, dn u,) + aretg (tnu, dn u,) = 


sin Peel sin — —— 
= arcte ui V1 — k? sin? + arctg a VIE k? sin? 
8 Perl 8 Pils 
COS Hy COS ©; 


2 


(sing, = SN 4, COS gi = CN , sing, = SN Us » COS g, = CN Us). 


On en déduit immédiatement le théorème d’addition correspondant pour 
les fonctions y(r;), y(t.). Il n’est pas nécessaire de souligner le caractère non 
algébrique de ce théorème d’addition. 

Il peut être intéressant de rattacher les ondes planes solutions des équa- 
tions (34A), (34B), (34C) aux développements de la théorie quantique des 
champs. 

Ceci revient à exprimer les ondes planes du cas (34A) par exemple, de la 
forme 


(72) y(t) = Aent = À en [uoct, k], 
dans le système propre, au moyen des fonctions 


(73) Acost’=Acosuct, ou  Asint'— À sin wct. 
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La théorie des fonctions elliptiques nous fournit immédiatement deux 
développements de ce type. 


a) Le développement en série de Fourier des fonctions elliptiques nous 
donne pour en“ 


È Dies ght nu 
(74) cn = 2e APTE COS en +1) a 
avec 

ex 1G 
q = exp tr 
Nous en déduisons 

F7 27 grt n 
(75) wir) == A cn\urt, k) = A FK, i+ gra cos (act) , 
avec 

I IT 

(76) pa = (2m +1) = to. 


L’onde y(t) peut être considérée comme résultant d’une série particulière 
d’ondes planes solutions d’équations de Klein-Gordon avec une suite de masses 
propres réduites w, multiples impairs de la masse propre réduite 


i IU 
(77) Ma = 5 He Po : 


b) Le développement en produit infini des fonctions elliptiques donne 
pour en u: 


mb 12 | 1+ 2g?” cos (xu/K) + q*” | 


nu = 2g*k'?k-* cos —- i 
(78) ecnu= 2q COR II 1 — 2q%-1 cos (mu/K) + qs? 


Ceci nous donne 


he ee poe f 1 + 292" cos pct + gt" 
7 : a — 92911:!% 4 È 
(79) w(7) =Acn[pmoct, k] = 2qtk'tk-* cos poet II F Dg?" cos wet + >|" 


avec ici 
7 TT 
(80) Ho = FF bo) 
d’où 
Ho > Mo ; pour 7/2 < K(k)<x, 


HU: pour K(k)> x: 


vi 
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L’onde plane y(t) s'exprime donc au moyen d’un produit infini de com- 
binaisons d’ondes planes solutions d’une équation de Klein-Gordon pour un 
corpuscule de masse propre réduite y, = (x/K)o- 


6. — Solutions invariantes et solutions radiales des équations précédentes. 


Nous complèterons cette étude en examinant brièvement pour les équa- 
tions du type 


(34) Oy + pip +: wy? = 0, 


les solutions du type «ondes invariantes » y(u) avec u?= ct? — (w+ y?+- 22), et 
les solutions du type y = y(r) avec 7? = 2? + y?+ 2. 
Ces solutions particulières sont déterminées par les équations différentielles 


di 3 d : 

(81) Agia dae ui] plu) + way? = 0, 
d? 2d A 

» ars + Fay VE 


Les équations de ce type ont fait l’objet de nombreuses études mathéma- 
tiques notamment de R. O. FORNAGUERA [7], de M. Cimino [3] et de JAICH- 
NICYM [12]. Leur intégration ne semble pas rattachable à des transcendantes 
caractérisées jusqu'ici. 

Nous n’indiquerons que quelques résultats relatifs au cas wu — 0. 

Les équations (81) et (82) se réduisent alors à 


dist MEO 
(33) law + aa y(u) + uay° — 0, 

ar da co 
(84) FEE w(r) F way? =0. 


L’équation (83) admet la solution particulière remarquable 


V+1 
y(u) (AU fete 


d’où l’on déduit l’onde invariante singulière sur le cone de lumière 


VEIL 


(35) y(e, Y, è, t) = 2 
pe — (a + y? + 22) 
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L’équation (84) sous la forme 


d? À 2. d Lat n 
ela TO, 
se ramène à l’équation d’Emden [4] étudiée notamment par E. A. MILNE [14], 
N. FAIRCLOUGH [5], R. H. FOWLER [8]. 
En effet, si l’on pose 


g(r) est déterminée par l’équation 


2 
” = ! 
Pe err ere On, 


qui est la forme canonique de l’équation d’Emden adoptée par E. A. MILNE [14]. 
Si g(r) est une solution, on voit immédiatement que 


Ag Ar) , 


est également une solution. 
E. A. MILNE a étudié les différentes formes de solutions g(r) telles que 


dg di 
Pn) =0, (To =a aan OF? 


pour 7, —1 suivant les différentes valeurs de C. Il a montré notamment qu’il 
existe une seule intégrale positive telle que g(1) = 0 et qui pour r —0 prenne 
une valeur w(0) restant finie. Réciproquement (solution d’Emden) si on con- 
sidère une solution g(r) qui pour r —0 prend une valeur finie (pour laquelle on 
peut poser g(0) —1 avec une valeur convenable de A) et telle que (dp/dr),_,,=0, 
on trouve la fonction tabulée par N. FAIRCLOUGH [5] qui s’annule pour 
r — Y — 6.9011 et en ce point p'(n)=—0.40231 et rîp'(n) = — 2.015 0. 
L'intégrale générale de 


I 2 
Pr +7 Pr ror = 0, 


dépend ici des deux constantes À et C. Pour toute valeur finie de 4, il existe 
pour g(0) donné une valeur de C=C) pour laquelle il existe une solution. 
Cette solution s’annule pour r = r, et la tangente a cette solution pour r= 7, 
définit C=- O,. Pour les autres valeurs de 0+ C, il existe des solutions g(r, C) 
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telles que g(r0, 0) = pro, Co) mais divergentes pour r—0, les unes tendant 
vers + co, les autres vers — co. E. A. MILNE a montré l’allure générale de 
ces fonctions sur un diagramme. Toutefois je ne crois pas que l’analyse de 
Milne ait été étendue au domaine r > 7, sauf dans l’étude générale de R. O. For- 
NAGUERA et dans une note de JAIëNICYM dont les résultats ne semblent pas 
se raccorder avec ceux de Milne. 
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Contents. — 1. Introduction. — 2. Unrenormalized theory. — 3. Re- 
normalization. 


1. — Introduction. 


I shall begin with a brief résumé of some results on unrenormalized field 
theories which are relevant for the purpose of this talk; I shall offer no proofs, 
since they have mostly appeared in print starting from 1952 (1), and only try 
to establish the notation and make things as plain as possible. All this will 
be about a) perturbative expansions of propagation kernels, and b) equations 
which connect the kernels themselves. This done, one is left with a formal 
theory which is in good shape; in cases where there is no other trouble due 
to divergent integrals (approximate field theoretic models, or conceivable ap- 
plications to many-body problems arising from statistical mechanics or some 
other parts of physics) it may even be regarded as final, because one can often 
shown, then, that the perturbative expansions are convergent. 

In all cases of major theoretical interest, though, this is but one step to 
the end; just about everything turns out to be infinite, so that renormalization, | 


(*) Text of a talk delivered at the Collôque sur les Problèmes Mathématiques de 
la Théorie Quantique des Champs. Lille, June 3-8, 1957. 

(1) E. R. CArANIELLO and S. FuBINI: Nuovo Cimento, 9, 1218 (1952); E. R. Cara- 
NIELLO: Rel. Sem. Ist. Fis. Roma (July 1953); Nuovo Cimento, 10, 1634 (1953); 11, 
492 (1954); 12, 561 (1954); 2, 186 (1955); 5, 739 (1957). 
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or the art of removing consistently infinities into the unknown parameters of 
the theory, masses and charges, becomes the next major problem. The third, 
(but not last) question is whether in a renormalized theory solutions exist or 
not. Methods for carrying renormalization through flood the literature and 
are certainly more than I can claim to know; apart from this, regularization, 
or use of some device to make something finite which is not, has been much 
applied. Also on the question of existence much work has been done; so many 
people have felt the need to disprove existence that this alone shows that no 
one feels entirely happy with his friends’ proofs: let us just say that no proof 
of non-existence has been proved to exist as yet. 

My personal attitude is very modest: I do not know anything about exis- 
tence, and offer no guesses or «intuitive » arguments, because I strongly fear 
that wishful thinking does not suffice to turn foul mathematics into clean 
physics. Also in dealing with renormalization, I am far from claiming to have 
solved a problem which has been solved already so many times. I rather take 
Hadamard’s viewpoint, when he states that what matters more sometimes is 
not whether a problem is solved, but how much solved it is; and shall propose 
a purely mathematical version of the renormalization program (which includes, 
here, also regularization): one, that is, that a mathematician entirely ignorant 
of physics would naturally find, which nowhere makes use of infinite quantities, 
avoids all graphology and, above all, is simple to formulate and conducive 
to exact mathematical statements (majoration of integrals, etc.). 

I shall show, that is, that if the concept of integral is suitably redefined 
(think, for instance, of Hadamard’s partie finie), then it follows automatically 
that the formal, unrenormalized theory becomes ipso facto the renormalized 
theory. Any such prescription—infinite choices are possible—can be proved 
to be equivalent, to within additional finite renormalizations, to the formal 
use of counter terms. But this becomes now only a thing to prove once for 
all and then to forget about—we only need to know that such a proof exists, 
to be able to work happily with our re-defined integrals—keeping, of course, 
masses and charges as indeterminate parameters to be determined from ex- 
periment. Renormalization—I mean, of course, infinite renormalization—dis- 
appears, simply, from the stage. 

I shall proceed by starting from the perturbative expansions, because the 
identification of the terms that the re-defined integrals eliminate with cor- 
rections to masses and charges is best seen this way; it is a trivial matter to 
verify, conversely, that one can work as well directly with the equations among 
kernels, or whatever exact or approximate equations are deduced from them— 
the method is, in fact, completely independent of any perturbative treatment. 

Here too I shall be obliged to report only the salient points, because some 
hellish combinatorics is required, and you would never forgive me if I were 
to put all of it on the blackboard. 
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2. — Unrenormalized theory. 


21. — Suppose a Fredholm equation were given to a mathematician; sup- 
pose this mathematician knew rules to evaluate all the terms which belong 
to the 4” term of the Fredholm expansion of the resolvent, but was entirely 
ignorant of determinants and of how many terms are contained in the expansion 
of a determinant, and each term he would picture with a graph, say a term 


prenantes with PR nes 
fro =/dé,f(é,) . 


This is exactly the situation in the theory of perturbative expansions as 
it was when I learned it, in 1952. Any such graph is a Feynman graph, and 
the case just mentioned is the very physical one of an electron in an external 
E.M. field; the kernel k(xy) expresses the propagation of the electron from 
point x to point y if no field is present: we call it the « free propagator ». (The 
Fredholm eq. is, as everybody knows, the theory of one field) (2). 

If two types of particles move about, one of which can create or absorb 
the other—say electrons and photons—then a Feynman graph will look like 
this: 


and be described by 
J heorya(12)H25)e(Gy)Rlo's) (48) kB’ GORI RTS (14M (O5ACBONNT8) E 


where h(xy) is the photon free propagator, k(xy) the electron free propagator. 
Several papers have been published just to attempt approximate estimates 
of the number of such graphs pertaining to a given 4”. 
Again here we may take Hadamard’s point of view: Feynman graphs solve 
the problem, but do not solve it « very much ». In the Fredholm theory, we 
could not go very far without knowing about determinants. 


(2) The best work on this subject was produced by P. T. MatTHEws and A. SALAM: 
Phys. Rev., 90, 690 (1953). 
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It is time to state nomenclature and notation. I shall call «kernel» K 
what, in the Fredholm theory, is called « resolvent » or « resolving kernel », or 
« Fredholm minor », according to the number of variables in it. What mathe- 
maticians call kernels, I shall call «free propagators » and shall write (xy) for 
the electron free propagator, [vy] for the photon free propagator. 


(zy) stands for 383. (2 y) = (24)}.a, (with spinor indices), 
and 


[vy] stands for UP ls (x, y) =[ry],,,, (With vector indices). 


There will be constant numerical matrices, which saturate indices of (xy) 
and [wy]: these I shall denote with y*=y{*,. Thus: 


| | (w1)y1(12)p4(2y)[12) , 


means actually 


> fe fer NEPCT CP È 


pia, 


The equations of field theory are singular, so that such expansions may 
very well be meaningless; but, as in the Fredholm theory, one may gain useful 
information by first studying the corresponding regular case, that is the formal 
unrenormalized theory, which we may suppose to have made regular with 
use of cut-offs and finite instead of infinite domains of integration. 


2°2.— The Fredholm theory is only a very particular case among field theories ; 
in general, determinants do not suffice. Given any field theory, I proved in 
1953 (*) that a perturbative expansion which generalizes that of the Fredholm 
resolvent can always be written if two algorithms are introduced: pfaffians 
and hafnians. 


Definitions and notations. — Let w"= œx' denote a linear form (summa- 
tion convention) in @!, x?,..., #",... which are elements of an algebra. We 
define a product (outer, or Grassmann product): 


(1) ah A dt = — wk A woh (Ad = 0) 


Then, as is well known: 


12/45 


1 di i 
(2) WNW... \wt = al; ; 5) iar … Aa, (kK<n; =(), if k>n), 
. di; 2 eee 
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12 

where ( ( . lis the Sylvester notation for a determinant with row indices 
LA ae 

1, 2,..., k, column indices 7,, è,,..., î, and elements at. 


If we define another type of product (Clifford product): 
(forgive my notation!) 


(3) x \ ck = — vA + 20px (a Av" = 1) 
we find 

[el 3 
(4) AO GO? RO NOE = DATI. Nar) A (err, 


r=) cr 


molere: hy SRE ER, Lh... <b, ai a permutation of 1, 2;..., koi 
parity p; > is the sum over all such permutations for a given r; [k/2] is the 


Cr 


maximum integer contained in k/2; (A, ... h,) is the pfaffian, as defined below 
lie 2: 
ww? = (12) + w1/\w, (12) elatone 


Once the elements (hk) are defined (mark that, in general, (hk)  (kh), 
as against the example just reported), then the pfaffian is defined by recursion: 


(5) OSIMO NEAR or), (Or 1th teumley 


h=2 
or 
= D (= 1)? (4,%,) tee (doni; dor) ’ 


where >’ denotes sum over all permutations 7,1, ... à, of 1, 2, ..., 2r (of parity p) 
such that: 


Dott Sen AM Oy ge aa es 


A convenient symbol is also 


(12) oe Le (1, 2r) 

QL 227) = 
(23 a aes. (2, 2r) 
(2r—1, 2r) 
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2m 
om 


dl 


| (hk), h<k, 
| is the determinant of elements ay =\— (kh), h>k, 

|. 0 eee 
In a pfaffian, there are lines (line h: all elements with a label h), but nor rows 
and columns. 

This is all I shall say about pfaffians and determinants for the time being. 
The actual story is much more involved with combinatorics, but I think this 
may suffice to show why anticommuting fields give rise, in the kernel expansions 
only to pfaffians and determinants (the latter are a particular case of the former: 
if suitable elements of a pfaffian are made to vanish, it reduces to a determinant 
of half its order). 

Typical of the above is the expansion rule of pfaffians and determinants 
which prescribes alternating signs—a consequence of the anticommutation 
relations, or Fermi statistics, of fermion fields. 

When studying the other accepted case, of commuting or boson fields, one 
finds that exactly the same algorithms appear, except that all terms in the 
expansions have always a positive sign. To avoid confusion, I use systematically 
round brackets for things connected with anticommutation and square brackets 
for things connected with commutation. In the study of boson fields, one 
finds thus: 


1 
he 
where (i 


bo N 


Lien 
permanents | 1 "4 of elements [hk] (counterpart of determinants) 
and 
hafnians [1 ... 2m] of elements [hk] (counterpart of pfaffians). 


(The last name is my own contribution, for which I offer no apology to you, 
but rather a nostalgic thought to Copenhagen). 


2°3. Kernel expansions. — I shall mention only electrodynamics (with a 
non-vanishing photon mass), but everything I say applies, mutatis mutandis, 
to all cases. The kernels I speak of are the vacuum expectation values of time 
ordered products of field operators you have heard so much about; if they 
are known, all is known. (They bear a variety of names in the literature, but 
I am quite contented with calling them kernels.) 

If #=€=1, and 2 is the interaction strength, the fundamental expansion 
formula in electrodynamics turns out to be: 


di see tn) = 


= Cal 1 x (li Wa Sy a 
2a ves & oes Yn, te à lee tr, E... Ex]. 


Ly... Ly, 


DA Ne eee K | 
Yi eee Yn, 
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In it: N, in-going and N, out-going electron lines, P, photon lines — N(P,) 

means that N > 0 has the same parity as P,. The determinant is the decay 

product of a primeval pfaffian (in a Majorana theory, the pfaffian would sur- 

vive), and contains all that pertains to electrons; the hafnian carries all the 
photon contribution. 

If the H.M.field is not quantized, then the hafnian is not there and > 

N 


ranges over all N >0: we recognize the Fredholm expansion. If the particle: 
field is not quantized, then only the hafnians are left in, and we can play 
around and sum the series exactly, obtaining known results of Feynman and 
others. 

We shall see that it suffices to consider the purely fermionic kernels (P,— 0). 
Then the series can be written (if it is allowed to exchange integrations) 


(7) o K(* ae vi n 


foe} AA7 1 1 eee Wy 1". San 
(5) | Sere nou Jess tasca 


n! Wis Wi ECS ae Can 


The vacuum expectation values are actually the ratios K,,/K,,, just as is the 
case with the Fredholm resolvent. I have also formulae (*) that give, as. 
simply, the expansions of these ratios, or of such things as the lg K,,: 


É162 
bE 


Soba 
Esta 


cee 


Co) }2% 
lg Koo Ela » apt tad pere) oe ERBA 


Sani #3 
Enn-1 Sen) ? 


where the bars in the determinant symbol indicate a simple rule to suppress: 
some terms of the expansion. This last formula, if compared with the familiar 
one for the lg of the Fredholm determinant, exhibits clearly all the similarities 
and all the differences between a two-field and a one-field theory. 


Convergence. — It was generally believed that an expansion such as (6) 
or (7) (at the time in which it was a major job to write or count each term 
of the coefficient of A”) should be actually divergent: there are indeed too 
many terms in this coefficient, and the relative signs were not known. 

By looking at (7), however, it is easy to see (4) that, if we take a non- 


singular theory (4-integr. vol. © finite; |(2y),_g, |< M, [09], I<D: 
Es) 2 n A6n 

(8) | Ky, | = > (5) at Q2Lr[ M?" (N, a Qn) Nol +n] 5 
n=0 . 


(3) E. R. CAIANIELLO: Nuovo Cimento, 2, 186 (1955). 
(4) E. R. CAIANIELLO: Nuovo Cimento, 3, 223 (1956). 


SS. a 
x 
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which is immediately seen to have a finite radius of convergence, independent 
of N,. This result is a disproof of previous too pessimistic views rather than 
a proof—but may have full value in cases other than relativistic field theories. 

If we specialize our restriction further, and assume that the boundedness 
of (xy) is a consequence of the fermion field having only a finite number of 
free modes, then it can be proved that the radius of convergence of (6) or (7) 
is infinite (5): K,,, is an entire function, and everything works as in the 
Fredholm theory. The proof would take too much of your time, and I have 
to ask you to believe YENNIE and me. 

I have not pushed this study further, for lack of time: there is a variety 
of problems that could be studied and I should not be surprised if some inte- 
resting results could be gathered with little effort, because the actual theory 
is after all a limiting case of such «regular », unphysical approximations. It 
will also be interesting to study the effects of renormalization—with some 
method suggested by the results I shall discuss later—on the convergence 
of these expansions. 


204. Expansions among kernels. — The kernels satisfy equations of various 
sorts, which I call branching equations because of the graphical pictures that 
can be associated to them. Since we have the perturbative expansions at 
hand, it is most simple to derive the equations from these; but it can be also 
immediately proved, and this has been done by many people (e.g. CAND- 
LIN (°)), including myself (1), that all such equations hold true even if per- 
turbative expansions are meaningless. These equations can be taken as the 
definition of the theory; they depend only on the commutation properties of 
the fields, and are therefore valid also for non-relativistic theories; if one starts 
from them, no mention need be made of the vacuum state, or of the fact that 
free propagators are solutions of differential equations. Such a theory may 
well turn out, for instance, to be different from standard field theory. It is 
important for the mathematician, I believe, if not for the physicist, that they 
are integral equations. The corresponding differential equations could be 
immediately written, and are, of course, of hyperbolic type. 

I shall list some of them: 


(9) x (% as Oy, 
Yr ++ Yn, 


Pa ; 
t, … tn) = » ILA (7 «Ly, 
1 


hes eee No 


to eos RESTO tha eee 2) + 


+4 J déy*[t,61E (e n > i 


(5) This very interesting result was kindly communicated to me by D. R. YENNIE; 
an alternative proof has been given by A. Buccarurri and myself, in Naples. Both 
papers are in print. 

(9) D. J. CANDLIN: Nuovo Cimento, 12, 380 (1954). 


Lab tn) A 


ae 
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which proves that it suffices to know only Bx 


ee 
> LES Lx M do lese 5 d'y 
(10) K | à SIE te) = ¥ (—1)"(a,1 K | 3 ge NS BP 
Yi ose Yx, x È SI ) | Yr) Yi oss Yn—1» Yn+x ses Yx, ; de as 


2 | déy4(a,€)K E TANI | Dia 2 


Yi ++» Yn, 
(+ difference from Fred. eq.), 
(and the associate equation) 


[by ase La = La see By 
(19: E ( Kile Die 1rd" | — 3° | dé, | dé, > payé. 


Yi Yx,/ n=1 Y FY) 


2 (ENE ELLE ( Cest Un, dI) , 
Yi. Ya, Es 


ITA tn) 


0 oo v Ly È 
(12) =, K i ml te) =fa dae di 
oA Yi ++» Uy, i 2 J 5 2 4 Yi Yn, È 


0 they nae lb 
(13) 4a ( Ur, ti sete) =—i aes 


Yi Yn, 


È 
È 
lo) K Ty eee Ly, DA BRE By, | 
(mi) x liu i) ato déK ee SE e) ; 


Yi. Yn, Yi ++ Yx, 


(14) 


(13) and (14) ((££) = [&£]= 0, by def.) derive from: 


(131) (v2. + mey) = ide — y) + LM) = i facere), 
F 
and 
i ola da 
(14) (O, mi) [ay] = à d(@ — y) » = il dé[2eéy], 


and are the counterparts of Ward’s identifies. 

These equations can be exploited to get other equations which contain 
‘only one kernel (but then, an infinite number of conditions is required to get 
a unique solution): some of these contain 0/07 and 02/07? only, besides in- 
tegral operators. Very interesting consequences can be deduced from them, 
but since work on these lines has just begun, I prefer to make no statements 
on this subject. It is interesting to observe that such equations, considered 
as differential equations in À are singular for À — 0 and A= oo. This is, in- 
deed, one of their major attractions. 

It may well be that the actual renormalized theory does not admit per- 
turbative expansion; it can be easily seen, though (7), that it follows from 


(7) E. R. CAIANIELLO: Nuovo Cimento, 1, 337 (1955). 


37 - Supplemento al Nuovo Cimento. 


i: 


DRE ee PNR PO A 


= 


5 +. Sty 
Ra dare Meo 


ee 
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the branching equations that if the renormalized equations admit solutions, 
and if these solutions are continuous in the point A= 0, then also all A-deri- - 
vatives of the kernels exist and are continuous for A= 0. This is, I think, a 


non-trivial result. 


3. — Renormalization. 


3.1. — The first part of this program, as outlined above, was successful 
mainly because the problems were studied in their most general form, and 
because use was made of configuration rather than momentum space. This 
I have found to be even truer in my study of renormalization: as long as I 
dealt with momentum-space integrals, I could only stare at expressions which 
were infinite, and that was all. In configuration space, instead, it is quite 
easy to analyze the reasons why the integrals diverge. Two are the obvious 
causes of infinities: the fact that the domain of integration is infinite, and 
the fact that the singularities of the integrand, when two or more points co- 
incide (or are on the light cone of each other) become of too high orders; the 
latter is the essential point to investigate, which remains completely hidden 
in momentum space. 

First of all, let us get rid of the first trouble. It can appear either as an 
infrared divergence: we rule this out by giving the photon a finite mass, and 
that’s that; or it can show up in terms connected with vacuum fluctuations, 
which it is convenient in this formalism to leave in to the very end, dividing 
then everything by Æ,,: according to tastes, we shall either keep until the 
last but one minute a finite 4-volume of integration ©, or introduce conver- 
gence factors to be kept until division by Kyo is performed. 

The second trouble is the root of all evil (*). I shall start, again, from the 
perturbative expansions; it will be quite simple to see then that the method 
I propose, if it works at all, works in general. 

Since singularities in the integrand occur both whether points coincide, or 
are on the light cone of one another, I shall use the word confluecne to denote 
both situations: we shall have confluences of order 1 if a pair of variables are 
involved, of order h if h+1 variables are confluent. Clearly, the singularity 
is of a different type for different orders of confluence. 

An essential fact is that each integrand in the perturbative expansions is 
a symmetric function of its arguments &,,..., éy: this makes legitimate the 


(5) The first to understand and describe very clearly this fact was, to my know- 
ledge, J. VALATIN. I refer to his talk for further details on his views, and just take 
this opportunity to thank him for several interesting discussions on this subject. 
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free exchange of orders of integrations, which would be highly questionable 
for single Feynman graphs. 

The main problem is one of combinatorics. It is so complicated, that se- 
veral tricks are necessary to cope with it. 


3°2. An example. — Let us first consider a very simple example, which shows 
some of the main combinatorial aspects of the question without the additional 
troubles due to the occurrence of infinities. 

Replace, that is, integrations with summations in the perturbative expan- 
sion of A, ,, which becomes (omitting, for brevity, the x, y, t variables), 
(let us forget here about y-matrices): 


(15) Ky,,p,(A) = DIA IS der Aa 


aD N 


du wi ° is symmetrie in all indices, which range all over a same set of values. 


We may take Y over all N, by defining ae to be zero if N ~ N(P,). 
It is very easy to see, then, that: 


where I, 


SN 
(16) > Fe + > ee En 2. Di fi. a fetta ne 


«ini 7 
a h 


this being simply another way of counting terms. 

Confluence now is mere coincidence of indices: confluences among indices 7’ 
are still left in, but the condition i # i... î,_, isolates confluences of various 
orders in the index à, 


2. pS , can equally well be written 
(16)' sy neat 


& Deb, which means that for any given set 


lutmep dint h 


of à …, , the index è can only be given values not taken already by some 7. 

(Note that Y =)’ has a different meaning for different values of N — A). 
Rari 

Going back to the perturbative expansions, one finds, after some juggling: 


Sa 
ax ("lh at) mi 
Gres Uns o o jh i RER a RO ee ie ae 
(Gry) 07 7 h! ys DA n Yx, ie a pd te VPS I) 


where K’ means a kernel evaluated with the rule: for each 7, sum over all 
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values of indices 47. To compare with: 


= E 


(12*) > => K ie ata a 4 È, woe tn) È 
i YU, Yn, À 

We discover the remarkable and unpredictable fact, that adoption of this 
rule (K > K') changes (12)* into (17), so that all terms isolated away by this 
rule are classed in increasing order of confluence and group together again 
into kernels, whose formal properties are entirely known to us. This finding 
is the only, reason why this line of approach proves successful. 

Note that the kernels K’ still have confluences among all indices 7-7. 
Divergencies, that is, are far from being removed at this stage. But let us 
suppose that all terms with h > 1 in (17) are dropped: either because we just 
impose so, or because, as will be the case, after due changes, in electro- 
dynamics, we are able to show that their omission is equivalent to a « renor- 
malization through counter-terms » of charges and masses: to simulate the 
latter case, we write, in doing so, À instead of 2. 

(12)* is therefore substituted by 
(18) ai e E) 

07 LEURS | à 


i - Yy, 1 


But (12)* generates, in an obvious way, the perturbative expansion of A, ,. 
If we try to generate it from (18), we have to consider higher derivatives ob- 
tained by iteration from (18). Thus 


OK y,p, 0 fil; za {Uy Say Dy, à | 
me Fak (Ce) Be Goi dle et, 


#i ‘ YN Ù) 


where AK” means, with reference to a perturbative expansion, « summed over 

all indices # è, j": because è was already excluded from the summation range 
for A’, and now we have to eliminate all confluences with j (= è). 

One sees that iteration to all orders automatically gives sums of this type: 

> , that is: all terms with confluences are dropped. If such terms are 


PÉTER AE 
infinite, we have now a finite theory. 

The use of the equation for @K, ,,/04 is therefore only a convenient trick 
to avoid the simultaneous consideration of all possible disjoint sets of con- 
fluences, which would be an awful task (°). 


(*) To my knowledge, B. FERRETTI was the first to use equations of this type in 
connection with renormalization; my indebtedness to him far exceeds. though, any 
particular question of field theory, for I have received from him more instruction 
and encouragement than I can possibly thank him for. 


edi 
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Finally, we can use (16) to change 
Punk es ARE ; ANT EVA Le 
SUR "o into DAR LIDIA O 
i meet ; GTA 
which is now a prescription on how the summation must be performed: it tells 
that Y must be performed first, and over all indices not already present in 


any total or partial expansion of K. This is the analogue of taking a partie 
finie of an integral, and is the form that we shall use later. It makes the 
procedure independent of perturbative expansions. 

(16) is very significant: it shows that we can either, with our rule, perform 
an integration at a time and drop terms, or drop terms from the N-fold sum as 
a whole, obtaining the same result. In the last version, we find here the ana- 
logue of Salam’s procedure to subtract overlapping divergencies. The latter, 
however, due to the use of momentum space and other reasons (it is less 
restrictive than ours), cannot be replaced by N successive integrations with 
a subtraction rule. It is most essential that both things be equivalent, or 
else we could make no strict mathematics at all on the branching equations:, 
these, after any number of iterations, still must yield, for consistency, possible 
equations with the same solutions (if any), and this could not happen if the 
rule for renormalizing an N-fold integral were not applicable also by succes- 
Sive steps; since it is so, this will suffice to show that the procedure is inde- 
pendent of perturbative expansions. 


3'3. Renormalization. — There are infinite ways of obtaining finite results 
out of divergent integrals: the task is to show that, by so doing, one is actually 
performing renormalization of charges and masses. From the previous example 
we have learnt some combinatorial tricks, and the technique to use /-deri- 
vative equations for kernels. We have also learnt the essential rôle of the 
symmetry of integrand as a whole in its arguments, and the necessity that 
whatever rule we use, it must satisfy (16), in some form or another, to assure 
iterability of the procedure. 

There are infinite procedures which satisfy these requirements (other con- 
ditions shall be found later, that are also easily satisfied). They can take the 
most various forms: Hadamard’s partie finie, Riesz’s analytical continuation, 
Leray’s convergence factors, and as many more as you care to invent. I shall 
first give a couple of examples, then proceed with a general formalism which 
need not specify the actual prescription which is chosen. 

The simplest to exhibit—although may be not the best—is the use of con- 
vergence factors. Start again from the perturbative expansions. Each in- 
tegrand is of type 


È ì y (a + Em LAN ae E 
[fdr FO à i Bape LEON ae pacetp. | » 
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It is desirable (although not necessary) to use covariant methods. Introduce 
then for each x, y, t, € a real number &,, ..., es and multiply the integrand by 
a form factor which makes it convergent, e.g. of type: 


WOLD 4 SP 2s. Y ive base Coeli 

This is symmetric, so that the integral, due to the symmetry of the integ- 
rand, is a symmetric function of &.,..., e.,. (Divergencies arising because © 
is co may be eliminated by multiplying be exp [ D (7: —;)}|(u; — us)? |], but 

i<j 
this point we neglect here). The integral therefore is a function of 
Lo. Y to Ego Ey e Et Es Ey + Ex, Ey, ConVergent except when some e’s coincide. 
The e,, €,, € we leave arbitrary: by so doing, we shall obtain the kernel, which 
is a distribution, as the limit of a regular function which tends to the distri- 
bution when &,e,e, — 0: this is an appropriate way to introduce distributions. 
We need only consider ¢,...¢,. Let the integral (omitting explicit mention 
of #,..., €) be the symmetric function = f(£183 ... €,). 

We are now exactly in the situation in which formula (16) holds: con- 
fluence in 1 means some e= e. Suppose first that f can be expanded in a 
Laurent series in the variables (e, — €, _,): for each such variable, >} can be 
split into two parts, one being the singular part, the other the regular part 
(of which, of course, only the first term survives). The symmetry assures 
that no matter what è we start from, the expansion is always the same. Then, 
if the additional requirements we shall meet later are fulfilled, renormalization shall 
mean here: keep only the regular parts, for each (e, —¢,_,) separately. 

We can as well take from the beginning e, = 0. Once the negative powers 
of (e, — 0) are eliminated, we are left with a symmetric function of & ... €,, 
and we can start all over again. This is of course a general outline: in practice, 
one will find ways to do it better and quicker. If a Laurent expansion is not 
possible (we do not know), the thing works as well: in any case, it will be 
possible to decide, without ambiguity, how to split into regular and singular 
parts in (e; — &;). 

The proof of this statement reduces to showing that something like for- 
mula (16) holds. Consider only confluences in ¢,, and assume Laurent-ex- 
pandibility (4, symmetric) : 


(1€ ... Ex) = [ +2) [ > + Da [(e° — €). [(ey— e,) 2 = 


-[3_ FRE 


has hN 20 hg... hvy2>0 h=- ®© peohN>O0 hyly=- 
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due to the symmetry, which is (16). The overlapping terms split away quite 
—1 
naturally. {(N —1) > Y stand of course for: 


hgz-hN2>0 h,=- 


> > Ann (E: — E1)}e{(Ex —E1)} + {(es E1)}"{ (Ei ey €) 


hg. ANO h,<0 


etc., which is the same: after all splittings are performed, all e°s will tend to 
zero, and the numerical coefficients come out right. } The only term that sur- 


vives after elimination of e, confluences is Y > 0. 
Rss AN 
A better procedure may be to consider the integrand as a function of 


(x; — &;)? (the yo terms coming out of the Feynman fermion free propagators 
can be disposed of quite simply—we shall not go into this). Then, replace 
each (x;— x;)? by (#;— #,;)2?— i(e;— €;)}, and procede as above: this is one 
of Hadamard’s prescriptions for taking the partie finie. This has the advantage 
of changing the free propagators, which are distributions, into honest and 
simple analytic functions: the same process that splits off the infinities secures 
too that the proper limits which change the analytic functions into the distri- 
butions are taken (1). 

In conclusion, it is better to study the question in general, without spe- 
cifying the rule, except that it must produce something like (16). 


34. Renormalization continued. — We can now extend the results obtained 
for our example to the general case. Write, whatever the explicit rule we 
decide to adopt: 


(19) Jasfas [AE FE, …… En) - fas] ae. Su ai|-.| Las, + ar PIE, … Ew) ; 


n 1 
where 


(20) fes = [as =“ di ’ 


1 


denotes that in performing the È integration all, and only, the divergent parts 

if any, which arise because &, is confluent with &€, are eliminated: a « partie 

finie » rule, formally, and the corresponding divergence is called diF(£, ... &y) 

The prescription will be such (as is actually the case with the previous examples, 

due to the symmetry of the integrand) that [dé,d/= difdé,, ete. This is a 
1 1 


requirement that need be satisfied only after all limits are taken (that is for 


infinite d', etc.). 


(1°) This has been suggested by A. S. Wicurman, to whom also I am greatly 
indebted for many enlightening discussions. 
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We find, therefore, the counterpart of (17): 


oK Y oe Ly &&, ce da | ) 
‘ Ky, i © È Hi nag yh i! > Elisa ,t wood 
(21) Sh i dé ey ds: d:y pl. Y K ss. la Oe EE, ee Er... Sees h 


which still contains divergencies, but not those divergencies arising from con- 
fluences with £. (21) is to be compared with 


(12) Son = (de > PK (- 


Tr 
LE 


The next step requires some complicated combinatorics. In (21), the A’ 
contain, if expanded, infinite elements: (£;£,) ete. We must take all such things 
away, so to have them acted upon directly by the di operators. Some com- 
binatorial theorems must be invented, after which one finds: 


oe e À d'y h+1 Chk+1)/-] feos ex. 
en a SH fae da ayy. ZIE gg) 


h=0 r=0 


= STE eee È 
4 [È eee Er 
hh; Eno” se. È,0” Vi eee Yn, 


where: r+s=20+o=h-+1, En, : CS AE EE -< É,; are a permutation of 
EE … &, ete. The s mean (ab) = (i b) — (ab); (è b)=0 (same for [ ]), and 


kernels are defined this way. 


The next step is an evaluation of the maximum infinities that may (but 
need not), occur, knowing the type of the singularities of (wy) and [xy]. Sin- 
gularities are worst for coincidence rather than confluence: then (xy) is sin- 
gular like e, [vy] like e. Each singularity appears in a 4-fold integral, so 
that each integration (which we may restrict, for this purpose, to a 4-volume 
of dimension ¢* around the light-cone of È) gives a factor et, In conclusion, 
divergencies for h-fold confluence occur only if 


lim e'*-/"-°0 20 


eo) 


(a finite value #0 might give a logarithmic divergence) or, in terms of s 
and o, if: 


b b b 
(23) ie +5o<h(t+5—4) +443. 


In electrodynamics, f= 3, b= 2, so that we find that the condition for 
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actual occurrence of divergencies in (22) is: 
(24) 38 +o<4, 


which is the standard result of all authors. 
One needs therefore only consider: 


Het eda 8=0, 
h odd 

[| = 9 5 ss 1, 

cee F s=1, 
h even 

[od 3 SEO 


The case c=1, 3, s=1(h even) is immediately seen to be out, due to 
Furry’s theorem. We also neglect here co = 4, s= 0. Such terms could be 
easily taken care of, in any case. We retain therefore only the terms: 


(010) h odd: 8 —0, =0: K és | | ; 


| 
(0 2) » § O — 2: K | Al 
(110) » Seti = 0 k(: SE 


(111) h even: E | nie (Be 


After some heavy combinatorics, we find again that some magic is at work. 
First, define some new, totally divergent «kernels», or better « cores of 
the divergencies »: 


Bene | so 7 2° 
DI Rig ee eet niet al ee AIR OSE 
eu (7 … Ye st o PRE I DARA 


(3 TI RIA 


ge Caio tien 
Jim ECs ors Es 56 és] 


Then, use (25): 
: E È di æ é 
(0|0) leads only tofdéy"L (: E), (110) leads tof déÉL ES E|, ete. 


Finally, one finds that necessary and sufficient conditions for the prescription 
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adopted to be a renormalization are that: 


Joss È di e = a(A)- (cy) + p(s ry) 


y Ë aj 
CASSE ua [xy] 
(26) fos È | 5 #)= y(4)- [zy] + 3(4) (m3) ’ 


fost (È AE i) = (1 fante y ten : 


È | 
ax(2) =| aer{ e), 


one has then, after some more work: 


Calling 


(27) [1 — Aa(A) — 42y(4) — e(2)] =. Fe [ao(4) + No(4) + 3Poy(4)]Kx,r + 


à ê » =. x Ly Ca 
— B(A) Sn. er VTT (2) ê(m} Er, = [ass abe «ve YN, È poss à ct) 


At r.h.s. of (27), the same step has been accomplished, that changed, in the 
example, Y K' into >" K (here: [dé >[dé). 
1 


It is at this point very easy to see that, if we consider 7, m,, m, as functions 
of new parameters À, m,, My: 
À =A(d), 
(28) m, = mM, — Sm,(A) 
mi = m;— Smi(7), 
and set: 
‘ ” y > A a 2 À > pee 
(29) Ky,p(A, Ms M) = AZPZiK (A, m,, M), A=A(A)= 222s 


we have 


A = - 
AK y,,p,(A, m , My) N DICI 
à £ b = AR" È È a , 


provided A, Z,Z,Z,, 1(2), satisty equations in À which are easily written and 


(30) 
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solved, formally, in terms of «(A),..., e(2). Thus: 


A 


AA) = à dA with f(A) = 1— Aa(A) et) Sy 
” (4) f -| res Pi 


0 


This shows that the terms dropped can be identified with (infinite, in a 
theory with divergencies) standard renormalizations. 


835. Independence from perturbative expansions. 


a) First, the argument used for the example shows that (30), used to 
generate the perturbative expansions, yields everything renormalized. 

We conclude that what we have done shows simply that if one changes 
fag, a dey into [dé, a fas, renormalization obtains automatically. 

To prove that this fact has nothing to do with perturbation theory, one 
has simply to start with (30), in which [dé is changed into il dé: iterations lead, 
in virtue of the branching equations, immediately back to (27). The same 
is true in all possible branching equations. Anything that the change {ag os | dé 
adds is of the form that counterterms would produce. 

The renormalized theory obtains from the unrenormalized one by changing 
every where | dé with if dé. 


b) Eq.’s (26), in a covariant theory, are a simple consequence of the 
Lorentz-invariant character of the definition of [dé À 
The proof is of the standard type, and need not be reported here. 


c) Provided one makes sure that a prescription satisfies (26), it can be 
applied to any non-covariant theory, or to such things as Bethe-Salpeter’s 
equations, ete. 


d) We have essentially shown how regularization must be handled so 
that it becomes renormalization. For the so-called non-renormalizable theories, 
final results may depend upon the prescription adopted: we have only a re- 
gularization procedure. One may ask whether the condition that the resulting 
theory by unitary, or some such thing, may suffice to render the prescription 
unambiguous: if it were so, all theories, or some of them at least, would be- 
come «renormalizable ». This is a bare chance, please do not take it as a 
conjecture. 


e) In conclusion, the theory is now on a well defined mathematical basis: 
things make sense, existence can be investigated. From a «practical » point 
of view, I believe that many computations should become far simpler with 
these new techniques than they have been in the past. 


x. 2, 1958 
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1. — Introduction. 


According to the wave theory of light the relative intensity distribution 
inside an interference or diffraction pattern is independent of the intensity 
of the light source, which produces the pattern. 

When, at the beginning of the century the corpuscular nature of light was 
reintroduced in its modern version, some doubts arose, whether interference 
patterns for low intensities are still the same as for higher intensities. 

Among others J. J. THOMSON (see preface of G. I. TAyLor’s paper) put 
the question whether a diffraction image produced by an extremely low in- 
tensity of light showed the same relative distribution of light as a pattern 
produced by a source of normal intensity. On the suggestion of J. J. THOMSON, 
G. I. TAYLOR (1) carried out experiments on diffraction patterns produced by 
different intensities of light and came to conclude that the interference pheno- 
mena are independent of absolute intensity. Later A. J. DEMPSTER and H. F. 
BATHO (?) carried out experiments in somewhat greater detail, using an echelon 
and they also came to the conclusion that the intensity pattern does not depend 
on the intensity of the light producing the pattern. 


(1) G. I. TayLor: Proc. Camb. Phil. Soc., 15, 114 (1909). 
(?) A. J. DEMPSTER and H. F. Baruo: Phys. Rev., 30, 644 (1927). 
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It must be emphasized that from quantum theory it follows that intensity 
patterns do not depend on intensity of light and there are no difficulties in 
interpreting the interference phenomena at arbitrary low intensities as long 
‘as we are satisfied with a purely mathematical description of the phenomena 
and do not try to picture in detail the process. If we try, however, to do so 
we come to paradoxical conclusions, as was discussed, e.g. by BORN (3), 
JORDAN (4), DIRAC (5) and many others. 

Regarding these paradoxes many physicists accept the point of view that 
it is sufficient to give a mathematically consistent picture of micro-phenomena 
and that it is unnecessary to try to give a picture of what is happening in detail. 
Other physicists, e.g. SCHRODINGER (°), regard this state of affairs as unsatis- 
factory and take the impossibility of interpreting the mathematical formalism 
as the «sign of a crisis ». 

It is seen that on the theoretical side there is no complete agreement as 
to the interpretation of the wave-particle phenomenon. Furthermore, a de- 
tailed scrutiny of the experiments of G. I. TAYLOR and of DEMPSTER and BATHO 
has shown that the results of these experiments are less convincing than usually 
assumed (7), therefore it seemed to be worth-while to carry out new experi- 
ments and to investigate the question anew. Such new experiments seemed 
the more worthwhile since making use of modern methods of counting single 
photons, the experiments could be carried out in a manner far more direct 
than was possible formerly. 

Using the photon counting technique we investigated the interference 
phenomena with a Michelson interferometer of an arm length of 10 em (°). 
The results of this experiment showed that there is no difference between the 
intensity distributions of the high and low intensity interference patterns. 
In the following section we should like to make a few additional remarks on 
the problem of low intensity interference phenomena. e 

A photon might be assumed to be a packet of length comparable with 
the coherence length of a spectral line. If the dimensions of the (e.g. Michelson 


type) interferometer are much smaller than the «size » of the photon, then 


the interference phenomenon might be accounted for by picturing that the 
photon floods the whole of the arrangement simultaneously and thus comes 
under the influence of the whole of the optical arrangement and, in particular, 
under the influence of the two mirrors at the ends of the arms. According to 


M. Born: Optik (Berlin, 1933), p. 465. 

P. JORDAN: Anschauliche Quantentheorie (Berlin, 1936), p. 14. 

P. A. M. Drrac: Principles of Quantum Mechanics (Oxford, 1955), p. 7. 

E. SCHRÔDINGER: Rencontres Internationales de Genève (1952). 

L. Jinossy and Zs. NAray: Magy. Fiz. F., 6, 105 (1958), (in Hungarian). 
L. JANossy and Zs. NAray: Acta Phys. Hung., 7, 473 (1957). 
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the measurements of LUMMER and GEHRKE (*) the coherence length (e.g. in 
the case of the green mercury line) can be assumed to be of the order of 1 m. 
This length is indeed larger than the dimensions of the interferometer used in 
our first experiment (*). For this reason we decided to repeat the interference 
experiment with an interferometer with arms long compared with the coherence 
length; thus we have carried out measurements with an interferometer with 
about 14 m long arms. Such an interferometer is appreciably larger than the 
coherence length of the photons, therefore, regarding it from the classical 
point. of view, a wave packet about 1m long, when falling on the optical 
splitting device (¢.g. on the semi-transparent mirror of the Michelson inter- 
ferometer) has to break up into two parts, each travelling along one of the 
arms; after reflection these two parts unite again. The difference of path 
lengths along the two arms is of course small and thus the two parts of the 
wave packet when considered classically unite in corresponding phase and 
give rise to interference. 


2. — The instrument. 


In carrying out the experiments it was necessary to draw a limit between 
what may be called a small intensity and what may be called a large intensity. 
No such distinction can be made on the basis of either quantum mechanies 
or pure classical physics as according to both of them the phenomena are 
strictly linear. However, from a primitive corpuscular aspect of light we may 
interpret as small intensity one for which on the average less than one photon is 
contained at any time inside the apparatus. Alternatively, if we picture the 
photons as light impulses containing many wavelengths and being of the length 
of the optical coherence length, then we may denote an intensity as small 
provided the lengths of the single wave packets about equal the coherence 
length and do not appreciably overlap. 

Although these criteria for small intensities have no particular significance 
according to the mathematical formalism of the accepted theories, nevertheless, 
they give a limit such that for intensities below that limit the paradoxes of 
interpretation discussed in the preceding article (7) arise. 

The optical path U of a photon in our interferometer is of the order of 
twice the arm length, J, i.e. in our case where 1 = 13.85 m 


l~2l1~30 m. 


Thus the condition for only a few photons to be contained simultaneously 


(*) G. LuMMER and N. GEHRKE: Verh. Deutsch. Phys. Ges., 4, 337 (1902). 
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inside the arrangement is that the intensity N expressed in photons per second 


should be 


Cero 1020 


ERA 
RETE 


= 107 photons/s . 


Taking the second criterion, i.e. that the «photon bands» should not ap- 
preciably overlap we may require for a small intensity N that 


(6) 
CN Te A 


, ~3:-10* photons/s , 


where A is the coherence length, the order of magnitude of which we may 


assume to be — 1 m. 


We carried out 


measurements with 


Michelson interferometer with 


1— 13.85 m arm length counting the photons in the pattern JP by means of 


photomultiplier P (Fig. 1). The inter- 
ference pattern consisted of vertical frin- 
ges. The fringes were projected through 
a vertical slit onto the cathode of the 
photomultiplier. The multiplier together 
with the slit was moved by small steps 
along a horizontal path and thus the rate 
of incident photons was counted in various 
points of the fringe system. In this way 
we determined the intensity distribution 
in the interference pattern both for high 
and low overall intensity. The exper- 
imental procedure and the evaluation of 
the results were done in a way similar to 
those described in a former paper dealing 
with the small interferometer (°). 

So as to get reliable results great pre- 
cautions had to be taken. In order to 
measure out an interference pattern point 
by point a measuring time of about an 
hour was required. During this period the 
pattern must not move by any appreciable 
fraction of the fringe width as otherwise 
the patterns obtained for high and low 
intensities cannot be compared quanti- 
tatively. At the same time the low inten- 


Fig. 1. — Arrangement of the meas- 
The incoming beam By 


uring set. 
is split by the semi-transparent 
mirror P,, into beams B, and B,. 
The beams B, and B, reflected by 
mirrors M, and M,, respectively. 
form in the plane of a slit S of the 
multiplier P the interference pat- 
tern IP. This interference pattern 
is counted out step-by-step by mo- 
ving the multiplier P. The arm 
length of the interferometer was 
1=13.85 m. 
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sity pattern is of such low intensity as to be practically invisible and therefore 
no check can be made while the measurement is in progress as to the con- 
stancy of the pattern. We note that so as to avoid a fringe shift of 1/10 of a 
fringe the arms must remain constant to an amount of the order of 2/20. A 
lengthening of 2/20 of an arm might be caused by a temperature change of 
about 10-5 °C. 

So as to achieve the necessary stability the apparatus was built in an 
underground tunnel cut in rock 30 m under the surface of the earth. The 
length of the tunnel was 20 m and its width 2.5 m. The interferometer was 
mounted directly on the concrete floor of the tunnel. 

The tunnel appeared to be sufficiently stable both regarding temperature 
and mechanical disturbances so as to be suitable for the measurements. How- 
ever, the manipulations with the arrangement disturbed the interferometer 
to such an extent that it was found necessary to operate the arrangement 
entirely by remote control from a different compartment. It was found that 
after all necessary adjustments had been carried out it was necessary to leave 
the arrangement for a few days to rest. After such a period all mechanical 
tensions relaxed and temperature equilibrium was obtained and using the fully 
automatized gadgets successful series of measurements could be carried out. 
The automatics is described in greater detail elsewhere (°). 

Because of the great sensitivity of the interferometer to any changes in 
the temperature the dark current of the multipliers could not be reduced by 
cooling Cooling would have been of disadvantage among other reasons also 
because it reduces the sensitivity of the multipliers (7°). Particular precautions 
had to be taken so as to reduce the dark current pulses of the multipliers 
without reducing the sensitivity. The dark current pulses of the multipliers 
could be reduced without cooling by methods described elsewhere (11-18), 

While the arrangement was locked up in the tunnel the pattern could be 
checked from outside by help of a periscope inserted by means of a mecha- 
nical manipulator. With help of the manipulator the intensity could be in- 
creased to such an extent that visual observation or photographic recording 
was rendered possible. If the interference pattern seen through the periscope 
showed any change the mirrors could be slightly adjusted by means of another 
manipulator so as to bring the pattern back to normal. From time to time we 


also took photographs of the pattern seen through the periscope. 


The block diagram of the arrangement is shown in Fig. 2. The light 
is obtained from a high-pressure mercury discharge tube L. The light passes 


) Zs. NARAY: Ann. d. Phys. 20, 386 (1957). 

) Zs. NARAY: Acta Phys. Hung., 5, 159 (1955). 

(12) Zs. NARAY and P. VarGa: Brit. Journ. of Appl. Phys., 8, 377 (1957). 
) Zs. NARAY: Journ. Sci. Instr., 88, 476 (1956). 
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through an optical gadget which can be used to reduce the light intensity by 
factors equal to powers of ten (!*). The reduced light beam is passed through 
the monochromator M, which selects the line 2 — 5461 À. The monochromatie 
beam is focused by means of the lens L, on to the slit S of the interferometer. 


ELECTRONIC COUNTER (EC) 
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eee eel 13.850 (LA) \/ 


Fig. 2. — Block diagram of the experimental arrangement. For a description see Sect. 2. 


A second lens L, produces a nearly parallel beam B,; which falls on the 
splitting prism P,, and is divided into two coherent components B, and B,. 
As we had not enough room to let both beams B, and B, travel a distance 
of 14 m in one direction we turned the reflected beam by means of a mirror M,, 
through an angle of 90° and let it move backwards along the direction of the 
incoming beam; similarly we turned the transmitted beam by means of two 
mirrorsM,, and M, by twice 90° and thus both beams B, and B, traveled 
most of their paths in the direction of the tunnel into which the whole arran- 
gement was mounted. The beams B, and B, are returned by the mirrors M, 
and M,. Thus the beams return to the prism P,,. The transmitted part 
of B, and the reflected part of B, pass through the telescope T and produce 
an interference pattern JP in the plane of the slit S of the photomultiplier. 
The photomultiplier together with the slit could be moved in a direction per- 


(14) L. JANossy and Zs. NARAY: Acta Phys. Hung., 5, 153 (1955). 
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pendicular to the direction of the fringes and thus the intensity distribution 
could be determined by counting the rates of photons in various parts of the 
pattern. 

In the course of the measurements a beam of about 10% photons/s falling 
on the slit S, represented the large intensity. The absolute measurement of 
this intensity was performed by a thermopile (15). To obtain the small in- 
tensity we placed an optical filter F, close behind the slit S,, which reduced 
the intensity by a factor 104, thus the small intensity amounted to about 
10: photons/s, the latter intensity can be regarded as small taking either of 
the criteria of Sect. 2. 


When measuring with the large intensity it was found convenient 
to place filter F, close before the slit S of the multiplier. The filter F, reduced 
the light intensity, similar to the filter F,, by a factor 104. The introduction 
of the filter 7, was found convenient as by means of this filter the large in- 
tensity—after the coherent beams had been mixed—could be reduced to the 
same intensity as is obtained from the small intensity beam falling into the 
interferometer. This reduction of the strong beam just before entering the 
multiplier is irrelevant from the point of view of the principle of the experi- 
ment. Indeed, if the filter F, is placed before the input of the interferometer, 
then weak beams of light interfere and produce a pattern. If, on the other 
hand, the filter F, is introduced before the slit we have strong beams of light 
interfering and producing a pattern, the filter 7, merely not allowing all the 
photons of the pattern to pass on to the cathode of the multiplier. 

The advantage of using the filter F, is, that in this way the multiplier is 
exposed both in case of large and small input intensities to approximately 
equal intensities of light. Thus overloading of the multiplier is avoided and 
a stable performance rendered possible. 

The interference pattern obtained covered a dise of 5 mm diameter. We 
used a square area of about 2.0mm side length, one side being parallel to 
the slit and so to the fringes. The slit was 0.2 mm wide and was moved in 
steps of 0.25 mm each across the pattern. We counted in each position for 10 s. 

Denote by x the distance of the slit from the vertical symmetry axis of 
the square and v'(x) the number of photons counted if the interference pattern 
is produced by low intensity with the slit in position x during 10s. Similarly, 
denote by »’(æ) the number of photons counted under similar conditions for 
high intensity interference. As in the latter case the light is reduced, after 
interference has taken place, by the filter F,, the numbers y'(«#) and v"(«) are 
expected to be about equal. 


(5) Zs. NARAY and M. Tor: Magy. Fiz. F., 5, 7 (1957) (in Hungarian). 
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So as to be able to compare the two intensity distributions the background 
count for each individual reading had to be determined. This was done by 
introducing shutters S, and S, (see Fig. 2) into the beams 5, and B, after each 
reading. The count thus obtained, with the beams of the interferometer 
blocked, were due to stray light and to the dark current pulses of the mul- 
tiplier. Denoting by »(x) and »,(x) the counts obtained under conditions si- 
milar to those under which »'(x) and v"(x) were obtained, except for the shutters 
S, and S, being closed during the measurements of the former, we have for 
the counts caused by the interference pattern itself 


LÀ 


p'{æ) = v'(œ) — (a), p'(e) = v'(x) — (2) 


for low and high intensities, respectively. 
So as to compare the intensity distributions with high and low input in- 
tensities, one might consider the ratio 


p'(x) 
L A(x) = ——., 
(1) (@) = 
The theoretical prediction is that 
A(#) = const. 


If we assume the filters F, and F, to be exactly equal, we expect 
(2) A(x) = 1. 


However, the relation (2) is not very suitable for à precise checking of the 
theory. Because of the optical faults of the system the functions p'(x) and p"(x) 
are not exactly sinusoidal. Computing thus A(x) from (1) we have roughly 
speaking to judge whether 

A(®,..) equal or not equal A(x.), 

Lor nin are the co-ordinates of the interference maximum and mi- 
nimum. However, the determination of A(,,,) is necessarily inaccurate, firstly 
because the counts p'(%,,,) and p"(%,.,) are in any case small, secondly be- 
cause the minima of the curves are not very well defined and are subject to 
even greater relative fluctuations caused by outside disturbances than the 
points of higher intensity. So as to get quantities which are more suitable 
for such a comparison, we have inserted after each period of measurement 
of v'(x) and v(0) a measurement with one of the shutters, e.g. S, open and 


where x__ and # 


MS Ro En le 2. 
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the shutter S, closed. Denoting the result of the measurement by v,(z) and 
similarly by »;(x) the corresponding measurement at conditions of high in- 
tensity, then 


p' (x) =) — ne) and  p"(x) = »,(x) — v(x) 


are the light intensities reaching the multiplier along one arm of the inter- 
ferometer with the other arm blocked. Thus 2p'(x) and 2p”(x) represent the 
average intensities of the interference field; i.e. the intensities which would 
be obtained if the two beams were to add up their intensities without giving 
rise to interference. The differences 


p'(æ)—2p'(x) and — p"(x) — 2p"(a) 


give thus the effect of interference on the intensity in the point x. In parti- 
cular the ratios 
p' (x) — 2p'(x) 


d'(æ) = 7 2p'(x) and a" (a) = ap") , 


give the relative intensity variation inside the pattern. If the interference 
pattern would fade out at low intensities as could be anticipated in terms 
of some model, we would expect è'(x) to differ from 7”(x); i.e. à’ (x) would show 
a variation less pronounced than ?(x), and if the pattern would fade out 
altogether at a very low intensity 7(#) would approach a constant at that 
low intensity. On the contrary if the pattern does not depend on intensity 
d'(æ) and è"(x) do not differ. The comparison of 7’(x) and 7’(x) is advantageous 
as these values should be expected to be equal regardless of whether or not the 
filters F, and F, are exactly equal; furthermore, these values at the intensity 
minimum and maximum are statistically about equally well defined, thus the 
comparison of è(x) and ?'(x) for «=%,n and æ — +, gives a statistically 
good criterion as to whether the interference pattern is not independent of 
intensity. 


During the series of measurements we had to carry out constant checks 
as to the stability of the arrangement. Among others, we introduced a mirror 
M,, into the beam before it entered into the interferometer (Fig. 2) splitting 
off by means of this mirror a constant fraction of the intensity. The inten- 
sities of this split-off beam were constantly recorded by a multiplier P*. We 
rejected all those readings where the overall intensity indicated by P* differed 
from an average value. 


D RSR eee P y . 
a er tetas fs O - 
> i pedi ok DAT 4 
vas à i 


INVESTIGATION INTO INTERFERENCE PHENOMENA ETC. 597 


The photomultipliers were fed from a stabilized high tension set constructed 
specially for the purpose (1°). Furthermore, we took care to keep the multipliers 
all the time in darkness. Details can be found in other publications (19-18). 


8. — The results of measurements. 


We carried out 12 runs of measurements and thus obtained 12 sets of 
determinations of the relative intensity variations #(x) and i’(#). The results 
of one of the series are shown in Table I and Fig. 3. We note that the average 


TABLE I. 

x Dv) v (a) mix) | vi(e) vo(1) vo(1) 
| | 
MOT 1.00 6550 13 350 5050. | 4850 1 600 1 600 
ie 0275 8 200 13 900 4700 4 850 1 600 1 600 
meee 05) 015050! 10150 4950 4600 1550 1600 
| —0.25 15100 13 300 4950 | 4850 1 600 1 550 
| 0 7 000 7 000 5000 | 4800 1600 1550 
| 0.25 | 1 800 2250 4750 4600 1550 1550 
| 0.5 6000 | 7250 4800 | 4400 1 600 1550 
| 0.75 12 550 6 000 4550 4000 1550 1650 
Be 1:00 13400 | 9 400 3750. | 3750 1 500 1 600 


intensity 2p(x) was of the order of 6-104 photons/s, while »(#) was of the 
order of 1,6-10* photons/s. 


i'(x), ix) 


Fig. 3.— Relative intensity variation of the interference pattern in a typical measuring run. 


(15) Zs. NARay and K. ZSDANSZKY: Zurnal Priborii i Technika Experimenta, 


108 (1956). 
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The curves Ÿ(x) and (x) do not show any systematic deviations. An 
exact analysis as to the largest deviations between 1'(x) and 1"(#) which would 
be still compatible with the errors of measurement could not be carried out 
for the present measurements. It seems, however, that any systematic de- 
viation greater than (10-20) % is unlikely. 

The quantitative analysis of the possible deviations between 2'(x) and 
i"(x) would require rather more material than we had obtained, when owing 
to outside circumstances the measurements had to be interrupted. We did 
not think it worth-while to restart the measurements at a later period as the 
results as they are confirm very well, even if only qualitatively, the expec- 
tation of theory for such small intensities of light where the paradoxes described 
in another article hold. As the result of our measurements we have therefore 
to conclude that interference phenomena are perfectly normal even at such low 
intensities where at one time in average less than one photon is to be found in the 
arrangement; this is true even if the dimensions of the arrangement greatly exceed 
the coherence length of the photons giving rise to the patterns. 


eee 


We express our thanks to Mssr. K. TITscHKA, J. KMETY6 and J. FÜRJES 
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1. — Introduction. 


We usually expand the field variable in terms of the suitable ortho-nor- 
malized complete set, and quantize every mode as independent harmonic oscil- 
lator. This method has its base on the formal correspondence to particle dyna- 
mics, and all modified methods of quantization (1) are proved to be equivalent 
to this original one. Field theory is now very tightly formulated in the so- 
called Lagrangian formalism, and, at first sight, there seems to be no room 
to modify the theory or to introduce new concepts into the scheme. The 
Lagrangian formalism is excellent in the point that it automatically warrants 
the internal consistence between wave equations, commutation relations, 
Schrôdinger equation, invariance and conservation laws, ete. (?). On the other 


(*) The text of this article is translated from the Soryushiron-kenkyu (mimeo- 
graphed circular in Japanese), 14, 599 (1957). 

(1) There are many modified methods, but we cite only the recent works. 
R. P. Feynman: Phys. Rev., 76, 749 (1949); P. T. Marruews and A. SALAM: Nuovo 
Cimento, 1, 120 (1955); R. E. PEIERLS: Proc. Roy. Soc., 214, 143 (1952); M. CINI: 
Nuovo Cimento, 9, 501 (1952); F. CoEsTER: Phys. Rev., 95. 1318 (1954). 

(2) See, for example, Y. NAMBU: Prog. Theor. Phys., 7, 131 (1952); J. SCHWINGER: 
Phys. Rev., 82, 914 (1951). 
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hand, it is necessary to improve the theory in any sense, because we are worried 
about the appearance of divergence difficulties in the present formalism. The 
renormalization method is limited not only in practical uses, but also in formal 
aspects (3). The theory of relativistic cut-off had been investigated by many 
authors (*), but we could not find any consistent method. 

The improvement, which is required at this stage of field theory, seems 
not to be the merely formal modification of the formalism or the more precise 
treatment of mathematical features, but it looks better to introduce any new 
physical image. For this purpose, it will be necessary to understand the 
structure of field theory as physically and intuitively as possible, before we 
consider what should be introduced. The investigation from such a point of 
view will make clear the physical significance of any modification of some 
parts of the scheme, and what kind of physical concepts can be introduced. 
It will serve also in order to get the foresight about the general situation of 
future problems and to understand the limit of some standpoint. 

Field theory is now arranged in a beautiful form, but, just for this reason, 
it becomes rather difficult to understand the physical image included in the 
formalism. It seems more fruitful to analyse the structure of field theory not 
from the formal aspects, but from the point of view of physical correspondence. 
In this article, we investigate the structure of field theory on the ground of 
the concept of causality, that was settled by STUCKELBERG (5). STUCKELBERG 
and his coworkers attempted to formulate the propagation of classical waves 
‘in the quantum theoretical mode, and, there, the formal correspondence to 
the quantum mechanics of particles was not used explicitly. This formulation 
is very similar to the Feynman’s relativistic formulation of field theory (1), 
but their standpoint has merits, which are not included in the latter, 

We can understand the situation that the fundamental assumptions to 
construct S-matrix are Lorentz invariance, unitarity and causality, if we take 
into account the causality in the sense of STÜCKELBERG. It becomes clear 
what the causality requirement imposes on the physical system, so we can 
get an intuitive image on the structure of S-matrix. Further, it was disclosed 
that the so-called renormalization procedure could be connected with the 
causal ambiguity, which was due to the insufficiency of the physical corres- 
pondence. It is very interesting that there remains such an arbitrariness yet, 
even though the present field theory seems to be constructed very tightly. 


(*) T. D. LEE: Phys. Rev., 95, 1329 (1954); L. D, LANDAU: Niels Bohr and the 
. Development of Physics (London, 1955), p. 52; G. KALLÉN and W. Pautr: Dan. Mat. 
Fys. Medd., 30, No. 7 (1955). 

(4) €. BLocH: Dan. Mat. Fys. Medd., 27, No. 8 (1952); C. HAYASHI: Prog. Theor. 
Phys., 10, 533 (1953); W. PauLi: Nuovo Cimento, 10, 648 (1953); E. C. G. STÜCKEL- 
BERG and G. WANDERS: Helv. Phys. Acta, 27, 667 (1954). 

(5) E. C. G. STÜCKELBERG and D. RIviER: Helv. Phys. Acta, 28, 215 (1950). 
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Of course, nothing is actively extracted from this arbitrariness, about what 
we must introduce. But, we can take out a standpoint to make the new phy- 
sical image that fills up the insufficiency of the correspondence. 

We start our survey from the explanation of Stückelberg’s correspon- 
dence-theoretical interpretation of causality, and of Fierz’s interpretation 
from the standpoint of the corpuscular aspect (Sect. 2). In Sect. 3, we analyse 
the causal character of the so-called vacuum-7-funetion, and, there, it will 
be studied what the causality requirement imposes on the physical system. 
The causal character of S-matrix will be made clear in Sect. 5. The insuffi- 
ciency of the causal correspondence to the classical wave image, and the re- 
lation between the causal ambiguity and the renormalization procedure will 
be taken up in Sect. 6. In Sect. 8, we propose a direction to modify the field 
theory, in accordance with our analysis. The connection between Stiickel- 
berg’s causality and the so-called microcausality also will be explained there. 
The causal aspect used in the dispersion relation will be discussed in Sect. 7. 
Appendices are devoted to the discussions about the limit of the theory of 
non-local interaction, from the standpoint of S-matrix formalism. 


2. — Causal function. 


STUCKELBERG (°) expresses the causality in field theory as the fact that 
action is carried by a wave of positive frquency which propagates from past to 
future. This is a correspondence from the standpoint of wave aspect. 

We consider a scalar field, as an example, to interpret how we can for- 
mulate this ee We define a At+-function constructed by an ortho-normalized 
complete set (*) {w,} in the following way, 


(2.1) i At(a, y): ae take 


This A+-function has the property, 


3 
(2.2) | d'y A+ (2, y) — Up (y) = Ug (2) , 
0Yo 


(*) This set needs not to be the plane wave. The ortho-normalization relation is 
given by 


3 

—i} d3 — W3(%) = dp» 

il ru, (x) ns" | ) B 

here 
2 9 of 
g 

— HE ana x 

f(x) an, (x) ( Ae x, (x) 
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for the arbitrary function w, belonging to the set {u,}. Namely, if we have 
a wave with positive frequency at the time y,, the A*(«, y)-function brings 
this wave to the time a, without altering the configuration. So, we can take 
At(x, y) as the causal propagator A(x, y), if 4 > yo. A*(æ, y) is equal to its 
retarded part, A,(x, y), there. The corresponding propagation function is 
D,..(®, y) in the classical Maxwell solution, and the quantum theoretical feature 
is here introduced by taking only the positive frequency part of it. D,, is 


not zero only on the future light cone, but D}, has its non-zero value near 


ret 
the light cone too. 

Next, we must extend the definition of this /1°(x, y) function to the case 
Yo < %, Without disturbing relativistic covariance. For this purpose, we take 


into account the fact that, if x, > %, A*(æ, y) function can be expressed as 


(2.3) A* (a, y) = SA (2, y) i A(x, y)} (Lo > Yo)» 


li 


by the two independent (symmetric and antisymmetric) solutions of the homo- 
geneous wave equation, 


A, Fa o (4%, î Ae ife A.) ? 


A Fi At SÉ Ai, + Ax GE À,, ai 
Here, the symbol ~ means that the left hand side of this equation is equal 
to the right hand side under the condition 4, >%. We here assume that 
4°(x, y) always can be expressed by a linear combination of 4,- and A-function, 
that is 


i 
A°(2,y) = 5{4i(2, y) + a A(&, y)} = 


role 


{Ci + a) Ah + (i— a) 45 + (+ a) An — (i + a) Ag}. 


If 2 > yo, it must be a=—i, as was shown above. In the case a <%, 
the effective parts are Ai. If A! remains there, it is not consistent with 
Stückelberg’s proposal mentioned in the beginning. So, we must put a=i 
for x < y, in order to take off this contribution. Thus, we obtain corres- 
pondence-theoretically 


(2.4) A (w, y) = Ad (0, y) + An (2, y) = Ala, y) += Ai(a, y), 


if % Yo. Here 
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When 4 = Yo (more generally, when the points « and y lie in a space-like 
relation with each other), we cannot define the A°-function by the correspondence to 
the classical wave character. This is the essential point for the discussions in 
Sects. 5 and 7. 

Taking into account the identity, 


A (a) = A(— x) ’ 


we get again a relation similar to eq. (2.2) 


<> 


‘ A Carne 
(2.5) foun ar A (x, y) = Uj (x). 


This equation means that the wave u;(x) with positive frequency at t= y% 
comes from the same configuration uz at t= Lo(< Yo). 

We emphasized above that A‘(#, y) cannot. be defined for 4 =% by the 
correspondence theory. We are interested in what character this undefinedness 
possesses. On the other hand, in this correspondence-theoretical scheme, the 
quantum theoretical character is introduced by the fact that the D°-function has 
non-zero value also near the light cone, on the contrary to the D,.,-function. 
How can we connect this character to the corpuscular image? For these ques- 
tions, Fierz’s consideration (*) gives an intuitive image. 

Now, let us take the Moller scattering of electrons as an example, following 
to FIERZ. The matrix element for this process is given by 


(2.6) fare] y px) y, p(x) D(£ — y) p(y)y, y (y) - 


Here, we restrict the emission- and absorption-points x and y in the suitably 
localized region around some four dimensional space-time points. We express 
these region by V, and V,. y(x)y,y(x) expresses the change of the state of 
the electron, and the positive (negative) frequency in its Fourier transformation 
corresponds to the increase (decrease) of the energy of the electron. If we 
require correspondence-theoretically that the energy of the quantum carrying 
action is positive, such an increase (decrease) of the energy corresponds directly 
to the absorption (emission) of the quantum. 

Let us assume, for definiteness, that V, spreads around t=0, and, there, 
the energy of the electron decreases by hy,, namely, 

1? 

ot — a ; 


VY) Yu (y) = Only) exp Ti 


(5) M. Frerz: Helv. Phys. Acta, 23, 731 (1950). 
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Then, we can extend the time-integration in the region V, to the whole time 
interval, and we get 


foo} 


; 2 
(2.7) fa exp ot = fi Dra(æ, 4 ) = 
0 for Dr: : 
re | (t. Fr) 
tn OP int Fr) — Tè ; for Dit 


Therefore, the parts which give non-zero contribution to this integral are D}, 
and Di, mathematically. But, we must take the D;,-funetion as the D°-fune- 
tion in eq. (2.6), physically, if we understand the decrease of the energy at y as 
the emission of photons. Because we must consider, from the standpoint of cor- 
puscular image, that photons run from past to future. 

We can take such a corpuscular image of the photon, only if the region V, 
lies in the wave zone. What relation between #, and 7 is imposed by this 
image? The Fourier frequencies of exp[— ¢#/T?] extend around v= 0 with the 
width 2/7. So, it must be »>1/7, in order that we can say anything about the 
sign of the frequency of exp [int — (/T?)]. On the other hand, only the part 
of the region V,, which lies within the width 7 around t, a 7, is affected by 
the action from the region V,, as can be seen by eq. (2.7). Thus, we can see 
that the region V, is in the wave zone, if t, > 7. (Then, r >1/r,— À, be- 
cause 1) ~ ty) > Tv >1). This correspondence-theoretical view has no mean- 
ing, if the region V, is not in the wave zone. This corresponds just to the 
undefinetness of D(x, y) for x = y, in Stiickelberg’s interpretation. 


8. — The causal character of the vacuum-7-function. 


In this section, we show that the so-called vacuum- T-function has the 
causal character in the sense of Stückelberg. 

If we assume for the theory to be invariant under the inhomogeneous Lo- 
rentz transformation, there exists a displacement operator P,, and this ope- 
rator satisfies 


for the arbitrary operator O(x). We interpret this operator P, as the energy- 
momentum operator of the system, and assume that there exists the lowest 
eigen-value of P,. We define the eigenstate corresponding to this value as 
the vacuum, ®,, and normalize its eigenvalue to be zero. We consider again 
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scalar field A(x). Then we can write as 


(Do, A(x) Dr) = aq, exp [ike] ee Ob de 


[Al 


Li ke 


Now, let us consider the case of the one-particle problem (7). In this case, 
the vacuum-7-function is given by 


{Ti Ala) A(a'))>o = {Mom a) AT (@ — a!) — O(a, ay) A (a — a')}s 


A (x — a) = — 1 A(x) A(x')» = — À ay: | 0%) o(— k?) exp [tk(x — x')|d‘k, 


A (a — 2') =i< A(x) Ala’), = à ay et o(— k?) exp [— ik(x — x')|d*k , 


i jo i a <0, 
Et Ina 0", 


and 


o(— k?) = (2x)° x do Uo : 


The 0(k°) functions guarantee the positive frequency character of the A‘*’-func- 
iton, and the negative frequency character of the A~”-function. We can see at 
once that this vacuum-7-function bears Stückelberg’s causal character, if we 
take into account the factor 0(x,). ; 
We discuss here a little what is responsible for this causal character, before 
entering into the many body problem. The essential point to the above con- 
sideration is that %, is positive definite, and this property comes from the 
lower-boundedness of the spectrum of the energy operator. But, it is not 
correct to conclude that the existence of the vacuum warrants the causality. 
The above analysis means only that we can take the vacuum 7 function as 
the function which has the causal character when there exists the vacuum, 
and this function has not the causal character if P, has not the lowest eigen- 
value. | 
If the energy spectrum is not lower-bounded, we can still take some state 
as the standard state, and make the states with larger eigenvalue than this 
belong to the positive spectrum, and the states with smaller eigenvalue to the 
negative spectrum. We can construct à function referring to this standard 
state, similar to the vacuum-T-function. Here, we can separate <A(x) A(æ')» 


(7) H. LEHMANN: Nuovo Cimento, 11, 342 (1954). 
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into two parts 


CA (a) A&))= 5 (0| Ala) |mt><nt | A(w’)|0> + 
+ D <0| A(x) |n-><n-|A(a’)|0>. 


The first part (the positive spectrum part) of the right hand side of this ex- 
pression gives the causal contribution to the vacuum-7-function, but the con- 
tribution from the second part (the negative spectrum part) has anti-causal 
character. There are two possibilities to make the vacuum-7-function regain 
the causal character. Firstly, we may introduce such an operator that changes 
the sign of the energy belonging to the negative spectrum, without changing 
the sign of the energy in the positive spectrum. Secondly, we may reinterpret 
the states belonging to the negative spectrum. The positron theory is an 
example of the second possibility. 

We consider an F-function definded in the following, in order to analyse 
the many body problem (8) 


F(x, Las vey Cn) = (Do; A(x,) A (22) ET A(x,) Do) 


The inhomogeneous Lorentz transformation to the scalar field operator A(x) 
can be written as 


A(Ax + a) = U(a, A) A(æ)U(a, A). 


We suppose that the vacuum state vector ®, remains invariant under this 
transformation, that is 


U(a, A) Qi = Die. 
Then, 
F(X: Las cs Un) = FAR “+ a, Ax, + à, …, Ax, +a), 
according to the invariance property under the transformation. Thus the 
function # is the function of &; == 7;,—.2,,,. We denote the Fourier momen- 
tum concerning to this variable È, by p,, 


eer met) = 


=n, d'p,... dip, exp [— 4(F, Pa + ÉaDa it Ena Pua) dpr; Pis cs DR Te 


(8) A. S. WIGHTMAN: Phys. Rev., 101, 860 (1956). 
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In order to make clear the physical meaning of p,, let us analyse the expression 


fer [ipa] d*a FE, ..., En, ha, Eras bea). 


Ji we put &; == 0 in this expression, and then rewrite — a as &,, this becomes 
the Fourier transformation of F with respect to the variable € The above 
expression can be deformed as 


—|exp{ipal]d'a<®,, A(x,)…. A(x;) A(æ,:, — a)... A(x, — a) D, 


= l@ A(a,) Aa) fata exp [ipa] U(—a, 1) A(x;;1)... A(e,)®)) i 
Thus the above operation takes out the component with the energy-momentum 
p of the system in the state D) = A(a,,,)... A(x,)®,. We can see by this 
consideration that the Fourier momentum p; is the possible energy-momentum 
of the state A(x,1,)... A(x,)D). The energy momentum four-vector is time- 
like, if we take into account the fact that there is always the center of mass 
for any physical system. Further, if we assume that the energy spectrum is 
lower bounded, G@(p,... p,) has non-zero value only if pî >0 and p?> 0. 
It must also be noticed in the following consideration that the factor 
exp [— ip(x — y)| (p? > 0, p° > 0) means that the action is propagated by 
the quantum of positive energy from y to x, as in the case of the one particle 
problem. 
The vacuum 7 function in the many particle problems, 


Are ey ey ae D, LA) Ale) te 40, )) D) 
can be expressed by the #-function; 


Tdi, Lys Ln) =F (x, Ha 4 1003 Cn) Her on DISSE) A) 


TOR des US 


TX: ; Lg y ve) Oa) = (0; Lise Ma) = F(—é,, È, "e. Sala 


à 0 m0 10 
LOR GR ae 0e 


We can see that this t-function describes the propagation of the action by the 
quantum of positive energy from «, to #,, (due to the factor exp[— ipé,}), 
if a >a} >. >, and from «, to 2, (due to the factor exp[tpé,]), if 
a >a>...>«#. Thus the vacuum-7-function for the many particle system 
has causal character in the sense of Stückelberg. 


L | rh A 79 4 = "HT IT OUT re oe 
} SEEN +. , 5 A 
/ fi “ 4 
= 
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We shall be able to construct the propagation function that has causal 
character by the same technique as in the case of the one-particle system, if 
the energy spectrum has not the lowest eigenvalue. 


4. — Causal character of the S-matrix. 


It is most convenient to analyse the S-matrix by the method of LEHMANN, 
SYMANZIK and ZIMMERMANN (°), in order to make clear the causal character 
of the S-matrix and the assumptions involved in the scheme. Their theory 
formulates the kinematical part of the S-matrix (or field theory), starting from 
few and considerably definite assumptions. 

Their first assumption is that the field operator reduces to its asymptotic 
field at t= + co, (asymptotic condition). This asymptotic field may be the 
clothed wave packet, or may be in the configuration of a bound system (*). This 
asymptotic condition is formulated explicitly in the case of scalar field, and, 
there, the bound state is assumed not to appear. The Heisenberg field ope- 
rator A(x) can be expanded in terms of the complete set of the ortho-norma- 
lized positive frequency solution f (2) of the Klein-Gordon equation: 


(4.1) A(x) = Y{A°()f(e+A*()f (0, 


where 


A“ (a) = i | A (x) = f, (a) dx. 


Co=t 


The free (but clothed) fields A, (7) and A, (1) also can be expanded by the 


in 
same set. The expansion coefficients A}*, A% are time-independent, and can 


out out 


be interpreted as the annihilation-, and creation-operators of incoming (out- 


(°) H. LEHMANN, K. SYMANZIK and W. ZIMMERMANN: Nuovo Cimento, 1, 205 (1955). 

(*) When we take into account the bound state, we must construct the operators 
which create or annihilate the bound systems. It is really possible to construct such 
operators, and to let their scheme include bound systems. The operator 


ee) = fre. P)dipAp-p Apyy» 


can be considered as the creation operator of a bound system composed of the two 
particles in the state n. Here, P and p are the total and relative four-momentum 
and F,, is the ortho-normalized eigenfunction of the n-state. This operator satisfies 
the relation 


LB, (Ph Bs (P')] an Cam OLE ee . 


\ 
A 
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going) free particles. The asymptotic condition is expressed as 


(4.2) lim (D, A%(t)V) = D, AP). 


™T—> © 


out 


Here, ® and Ÿ are the arbitrary state vectors in the Heisenberg represen- 
tation. 

This condition corresponds to the integrability condition in the differential 
formulation of field theory. If we did not impose this condition, their scheme 
would include such a non-integrable theory as the theory of non-local inter- 
action. But, the non-local interaction is excluded from the theory, as can 
be seen by the perturbation-theoretical solution of their reduction formula (*). 
This is closely connected to the situation that the degree of freedom of the system 
increases if the non-local interaction is switched on. So the asymptotic condition 
seems not to be satisfied, when we are concerned with the non-local inter- 
action. (See Appendix I.) 

They assume, secondly, that the Hilbert space at t=-+ © is expanded 
by the free states. Namely, they assume the existence of the vacuum, which 
is defined by (+) 


(4.3) 40 = 0 


out out 


and construct the base of Hilbert space by means of Af, operators, starting 
from this Q,, . oe 


SIR: 
out 


They further require the condition, 
(4.4) [A(2), A(y)] = 0 for (@—y} > 0, 


as causality. This condition can be replaced by the requirement that the 
vacuum-T-function is Lorentz covariant, and is not used explicitly. 
We can put 


(4.5) Q 


(*) They assumed that the interaction is of tri-lmear form in the first approx- 
imation. In this case, we must use the method proposed by KATAYAMA (1°), in order 
to determine whether the solution is a non-local interaction or a local one. This situation 
is due to the non-linear character of the interaction. We can see at once that the pos- 
sible solution is local, in the case where the interaction is bi-linear in the first approx- 
imation (see Appendix I). In this case, the possible interactions have the forms, A(x)? 
and A(#)LJA(a) (or 0,4(x)-0, A(x)), which are local, and the non-local interaction such 
as (L1A(x))? is not included in the solution. 

(+) This definition is here equivalent to assume the vacuum as the lowest eigen- 
state of energy-operator. 

(0) Y. Karayama: Prog. Theor. Phys., 10, 31 (1953). 
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because the vacuum is a collision constant when the interaction between clothed 
particles is switched on at infinite past and switched off at infinite future adia- 
batically (*). Then, the S-matrix defined by 

(4.6) (Do, DM) = (BY, SOL) 


in 


can be expressed as 


co 


(4a) S = C+ Ky t( Sy 280 La): Aga (Gy) nce Ant): de dr 


Here 
K,;=Ui—p’', 


and :A,,(#)...A,,(@,): is the so-called normal product of field operators A,n(Li)- 
The graphical meaning of the 7-function can be seen, if we transform this 
function as following, 


(4.3) T(a a) =| Ar x Li — È). Ap (Xn — En) O(E,, +. Fate ENS 


and substitute this expression into the above S-matrix. Namely, the t-function 
gives the matrix element of the Feynman graph (including external lines) for 
the concerned physical process (*). 

Now, this t-function has a causal character, as was explained in Sect. 3. 
Thus, the S-matrix also has causal character in the sense of Stückelberg. 

The reason why their scheme gives the causal structure to the S-matrix can 
be divided into two steps. Firstly, the vacuum-7-funetion appears in the 
S-matrix. Secondly, the vacuum-7-funetion has causal character in their 
model. The essential points for the first step are the asymptotic condition and 
the fact that ®,,, and ©, belong to the Hilbert space, generated from 
Q,,= Qu At this step, it is not essential that ©, and D,,, are constructed 


in out 


by means of the configurations with positive frequency (*). The fact that we 


(*) Qu may be different from ©, by an arbitrary phase factor e”. We may put 
this f as zero, because f is never observed. 

(+) This t-function includes the contributions from the so-called disconnected graphs, 
and we must subtract such contributions (11). But, this situation is not essential for 
our discussion. 

(x) In fact, if we construct these vectors by means of the configurations of nega- 
tive frequency, their reduction formulas (accordingly the expression of the S-matrix too) 
have the same forms as the original ones, provided that we replace f,’s by TER 

(11) R. T. MarTHEWS and A. SALAM: Proc. Roy. Soc., A 221, 128 (1954); K. NI- 
SHIJIMA: Prog. Theor. Phys., 10, 549 (1953); 12, 279 (1954); 18, 305 (1955); Z. MAKI: 
Prog. Theor. Phys., 15, 237 (1956). 


— di fit 
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used only the positive frequency part is concerned to the second step. If there 
are the configurations of negative frequency, it means that there are the phy- 
sical states with negative energy, as easily seen from the discussion of the pre- 
ceding section. In such cases, we must modify the theory, as was expressed 
in Sect. 3, in order to make the vacuum-T-function regain the causal cha- 
racter. We want to emphasize here that the so-called micro-causality (eq. (4.4)) 
is not used explicitly, and the covariance requirement to the vacuum-T-funetion 
is enough to get the above results. 


5. — Causal ambiguity and renormalization constants. 


The A‘(x, y) function for x, = y, remains undefined by the correspondence- 
theoretical standpoint about causality, as emphasized in the previous section. 
If we use the same function in the case 4, =% as the one defined for x # Yo; 
the S-matrix calculated by this function is completely identical with the one 
which is obtained starting from the differential Schrôdinger equation, and 
includes divergent integrals generally. STÜCKELBERG and his coworker (?2) 
gave a provisional recipe to treat this divergent integrals. Their method re- 
duces the divergence problem to one of indefiniteness. Further, they showed 
that such an indefiniteness could be connected to the so-called mass- and charge- 
renormalizations (!*), and their analysis made way to the concept of the re- 
normalization group (14). 

Their recipe can be interpreted clearly by the example, in which a neutral 
scalar field g interacts with itself by the g-interaction, as was shown by them. 
The S-matrix element for the self-energy graph is given by 


1 ! 
SO) 3 [dee arue ole, PEW) - 
The Fourier expressions for the 4, and A-functions are 


Ay (a) = aap | exp [tka] 6(k? + wu?) 


and 


An) = Ex | d*k exp [tka] 


ke + pu?" 


E. C. G. STÜCKELBERG and D. Rivier: Helv. Phys. Acta, 23, 236 (1949). 
(18) E. C. G. SrÜcKkeLBERG and T. A. GREEN: Helv. Phys. Acta, 24, 153 (1951). 
E. C. G. STÜCcKELBERG and A. PETERMANN: Helv. Phys. Acta, 26, 499 (1953). 
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The part of the above matrix-element, 

ACCRA 
gives a divergent integral, and the contribution from this part is 


[_ôp+u) | o((k—p) + A) 
kp) + a? pre | 


a fatr 


Their recipe for this case is as follows. We first re-express this integral in the 
form in which k’s are included only in the combination of ki. Next, we make 
the power of p in the integrand decrease by differentiating this expression in 
terms of Kis and then carry out the p-integration. At last, we integrate the 
result with respect to k? from zero to k?, in order to make the contribution 
from the convergent part regain the original value. Explicitly calculating, 
we get in the first step, 


i L 
fato fi 6'[p? + u? + (u — u?)k?], 
0 


and, by differentiating this with respect to k?, 


L 


| du (u — [as p d"[p? + u? + (u — u?)k?] = man 
0 0 


u — u? 


pet (w— ue) ke 


Their recipe is completed by integrating this expression with respect to k? 
from zero to k?, after the w-integration. 

In the general case, we must differentiate the concerned, matrix element 
by %? n-times, corresponding to the order of the divergence, to get a convergent 
result of the originally diverging integral. Accordingly, we must integrate the 
result with respect to k? n-times at the last step, to cancel out the effect of 
the differentiation in the convergent part. By such procedures, there appears 
an indefinite expression, 


bo + bit +. + by Rem, 


which vanishes by the »-times differentiation by k?. If we reform this ex- 
pression in such a way that k? appears in the combination of the wave ope- 
rator (k?+u?), the coefficients of this expansion can be connected to the re- 
normalization constants of mass and charge. 


Now, we translate this recipe into the co-ordinate representation, in order 
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to see its relation to the discussions in Sect. 2. The differentiation by k? cor- 
responds to the multiplication by (a —y)?. So, the differentiation by the 
necessary times corresponds to make A‘(a, y) finite at 2° = y° by the multipli- 
cation of (x — y}?", corresponding to the order of the infinity. The k?-integ- 
ration at the last step is carried out in order to make the (x — y)?-dependence 
of the convergent part of the concerned integral reduce to the original one. 
By this procedure, the result coincides with the original value at 2° 4 y° (that 
is, at the region in which the integral has finite contribution). Thus, we see that 
the above indefiniteness corresponds just to the undefinedness of the A“(a, y) 
function at æ°— y°. 

It is interesting to refer to the consideration in the correspondence theory 
from the point of view of the corpuscular image in Sect. 2. The conditions 
t, > T and », >1/T, which are necessary to make the corpuscular image 
meaningful, show that 7 (accordingly #, too), needs not to be so large if v, is 
sufficiently large. This seems to be the natural thing from the standpoint in 
which we combine the divergence problem to the indefiniteness. 


6. — Causality in the dispersion relation. 


GOLDBERGER (1) turned his attention to the fact that the scattering ampli- 
tude #,,(k; À, 4') coincided with the non-free part of the Fourier transform 
M,p(k; 4', À) of the expression, 


a 


lim <pa' |[p,(@), aly] |PA> , 


C>+ 00 
Yo—— © 


in the domain k, > 0, apart from a trivial factor. Here, k is the four mo- 
mentum of the scattered meson, and À, 4’ are variables, other than the 
momenta p and p’, of the scatter. If we translate the causal character, in 
the form 


Lo, (2), p(y)] = 9 for (7 — 9)? 0. 


into an analytic property of M,,(k; À, 2) in the complex k-plane, we can 
use Cauchy’s theorem. In this way, we can get a relation—dispersion re- 
lation—between the real and imaginary parts of the scattering amplitude. 

The above causal property really expresses that the observations at the 
two space-time points separated spacially from each other do not interfere 
mutually. But, the property, which is used in the practical analysis, is 
only that M,,(k; 2,2") is analytic and bounded in the upper-half of the 
complex k-plane. 


(5) M. L. GoLDBERGER: Phys. Rev., 97, 508 (1955); 99, 979 (1955). 
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We want to emphasize that the causality is used also in another form, 
in the derivation of the dispersion relation. Namely, the causal character 
that is discussed in the preceding sections is already included in the structure 
of P_;(k; À, a’). 

If we neglect this situation, even curious results may appear. These anal- 
ytic properties may be possessed by rather general models, provided that we 
do not take into account the fact that the analytic properties come from the 
commutation relation and use only these properties. For example, even in 
the theory of non-local interaction (*), in which we cannot construct the scat- 
tering amplitude F,, consistently, we can derive the dispersion relation, if 
we give F,; suitably. Lee’s model is an example of the non-causal model, but 
the scattering amplitude (16) has the above analytic property, and we may 
get the dispersion relation. 

The domain of the theory, in which the analytic property necessary to 
derive the dispersion relation is fulfilled, seems to be wider than the one of 
causal theory, as we can see from the above discussion. But, the theory that 
might be constructed by starting from the dispersion relation would not include 
a wider theory than the present field theory, as far as we require the causality 
to the theory. Because, the properties of the scattering amplitude, which are 
imposed by the causal requirement, restrict the possible scheme of the theory 
rather severely. The utility of the dispersion relation seems only to be a con- 
venient relation which connects the observable quantities or the scattering 
amplitudes of different physical processes to each other. In fact, the disper- 
sion relation can be derived without referring to the details of the concerned 
theory, but we must go back to the Lagrangian formalism or some theory 
equivalent to it which guarantees the consistency between equations of motion, 
quantization and Schrédinger equation, etc., in order to decide whether the 
relation is meaningful or not (+). 


7. — Concluding discussion. 


The structure of field theory can be understood clearly, if we accept the 
causality in the sense of Stiickelberg, as discussed in the precedings sections. 
Namely, the fundamental assumptions in our field theory are Lorentz cova- 
riance, unitarity of the S-matrix, and causality. 


(*) In this case, we cannot introduce the M-funetion in such a way as in the case 
of causal theory. But, we can define M x8(k; 4', 4) as the analytically continued 
function of F,,(k;2',4) to the complex k, value. If this M-funetion has the con- 
cerned analytic property, we can derive che dispersion relation. 

(°) T. OKABAYASHI and S. Saro: Prog. Theor. Phys., 17, 30 (1957). 


(+) Especially, the form of the Born term in the dispersion relation must be browse 
from the conventional theory. 
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At this point we want to call attention to the fact that unitarity and 
causality are independent to each other. All the non-integrable theory are 
non-causal in the examples, which are proposed up to now (17). But, we can 
see this independence from the discussion in the last part of Sect. 5. The 
perturbation method to construct the S-matrix proposed by STÜCKELBERG (°) 
also explains this situation. They construct the hermitian part of the n-th-order 
term of the S-matrix by means of the lower order terms, using the unitarity re- 
lation, and make up for its antihermitian part in such a way that the propa- 
gators included in the hermitian part regain the causal character in their sense. 

The interpretation of causality by Stückelberg make clear not only the 
structure of field theory, but also the conditions which are imposed on the 
physical system by causality. Further, it throws light on the relation 
between the limit of the correspondence theory and the difficulties in field 
theory. We can say nothing whether a further development of field theory 
is obtainable without modifying the above three assumptions. It might be 
fruitful to give up the causal requirement at least locally. But, it is very 
interesting that there remains in the present field theory a part which can 
be treated only by the mathematical recipe, and which is undetermined physi- 
cally even by these three assumptions. We succeed temporarily in the treat- 
ment of divergent integrals by the renormalization procedure, but we can see 
by the preceding discussions that this procedure lies out of the frame of the 
present field theory. On the other hand, this procedure gives satisfactory 
results at least in quantum electrodynamics. So, we must extend the 
theory in such a way that the new theory involves the physical contents of 
this protruded part. 

A tentative standpoint in this direction is to improve the fundamental 
concepts in the quantum field theory, in such a way that we complement the 
unsatisfactory feature in the causal correspondence, the indefiniteness of the 
A4°(x, y)-function at x°= y°. The required element can no longer be classical, 
but seems to be some quantum theoretical concept. 

We construct the present-day quantum field theory on the base of the 
stable particle image. Our discussions in Sect. 2 and 3 are based on this image. 
The quantization based on the unstable particle image may be considered in 
stead of the stable one, in order to extend the theory. Then the interaction 
vertex will behave effectively as a non-local one, though the field operators and 
their interactions are local. Thus, we may expect that this character makes 
up the unsatisfactory feature of the physical image at x° = y°, to some extent. 


(7) For example, E. C. G. STÜCKELBERG and G. WANDERS: Helv. Phys. Acta, 27, 
667 (1954); M. E. EBEL: Dan. Mat. Fys. Medd., 29, No. 2 (1954); W. HEISENBERG: 
Zeits. f. Naturf., 5a, 251, 367 (1950); 6a, 281 (1951); M. Frerz: Helv. Phys. Acta, 23, 
731 (1950). 
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One may oppose to this intention by the reason that electron and photon con- 
cerning our successful quantum electrodynamics are absolutely stable. But, 
the stable limit in the theory, which is formulated on the base of the unstable 
particle image, may be different from the one in the conventional theory, with 
respect to such delicate problems. Further, when we are concerned with the 
unstable particle, the two formalisms may give considerably different results, 
quantitatively. 

We previously investigated the renormalization problem of unstable par- 
ticles (1°), in order to get any clue to improve the theory in this direction. 
We took up this problem on the expectation that the conventional renor- 
malization method would fail, because it was based on the propagation cha- 
racter of stable particles. But, we could find the consistent renormalization 
method by a little modification (18), unfortunately or fortunately. We cannot 
yet find any new concrete clue by studying this problem. The most primitive 
method might be to expand the field operator by the complete set of the 
decaying waves, for example exp [i£t — y(£)t] exp[—ikx], instead of plane 
waves, and to quantize the expansion coefficients. Such a simple modification 
would be unsatisfactory and we should have to make any statistical consi- 
deration about the ensemble constituted by the unstable particles and their 
decay products. In such a treatment, we must give a life to the concerned 
particle from the beginning, and this may be a rather unsatisfactory feature (*). 

The other way to improve the theory under the three assumptions men- 
tioned at the beginning of this section will be to utilize the negative energy states 
suitably. But, the physical interpretation and the consistent formulation of 
the theory involving these states are very difficult (1°). 

At the end of this section, we give the relation between Stiickelberg’s 
causality and the so-called micro-causality. The former contains the latter, 
because the S-matrix based on Stiickelberg’s causality (') coincides with the 
one obtained from the conventional differential form of field theory, apart 
from the causal ambiguity. It is difficult to combine these two concepts 


(8) This problem is treated by a slightly different method by H. ARAKI, Y. Mu- 
NAKATA, M. KAwaGUCHI and T. Goro: Prog. Theor. Phys., 17, 419 (1957); V. GLASER 
and G. KALLÉN: Nuclear Physics, 2, 706 (1956). 

(*) In the conventional theory, the life is given as a funetion of the coupling 
constant of the concerned interaction. To the contrary, the coupling constants may 
be given as a secondary quantity determined by the life. The decay constant y can 
be interpreted as the imaginary part of the complex mass. Then, some of the coupling 
constants are absorbed in the complex mass. If we can construct the theory in such 
a way, it is not unsatisfactory to give the life at the beginning. 

() P. A. M. Dirac: Proc. Roy. Soc., 180, 1 (1942); W. PAULI: Rev. Mod. Phys., 
15, 175 (1943); W. HEISENBERG: Nachr. Gottingen Nat. Wiss., Nr. 8 (1953); Zeits. ie 
Naturj., 9a, 272 (1954); 10a, 425 (1955); 12a, 177 (1957); Zeits. f. Phys., 144, 1 (1956) 
Nuclear Phys., 4, 532 (1957). 
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directly, because Stückelberg’s causality is expressed in terms of the field 
quantities at the different times, and the micro-causality is formulated at one 


instant. We can give the corresondence only in the formal limiting process, 
as following: 


<O|][A(2), A(w')]]0>, x = <0|{ lim T(A(x)A(x')) — lim T(A(c)A(2'))}|0) 


Lo > + Ly—>By — E 


= lim AE, ni) lim ZAN (æ, x) 


Lo—> Lo + E Lo PX 7 € 
= Je — ( 7. 
ee Ag air t) dim ALI, t) 


Rm; t) — Jim Ay (r, —t) 


t->+0 


}— MOL 


APPENDIX I 


Non-local interaction (*). 


The basic defect of non-local interactions is that we have no relativistic 
method to formulate th>m consistently. If we use the relativistically in- 
variant non-local interactions, the theory becomes non-integrable. Because 
such interactions involve the form functions extending also in the time-like 
direction. In this appendix we start from the investigation about the system 
described by the Lagrangian which contains the higher order derivatives of 
the dynamical variable, in order to understand this situation as intuitively as 
possible. 

The non-local interaction can be considered generally as such a limiting 
case that the order of the derivatives of field variables contained in the inter- 
action becomes infinitely large. For instance, the form function F in the inter- 
action between a spinor field y(x) and a pseudoscalar field g(a), 


frere, xe", 0") p(x" ys p(x") da’ da” dx” , 


can be expressed as 
5 1 f oof x" Z 
fh a le = ep [UE l) exp Gal Sead | 


() Q I III. 
= a api? tase O(a’ — x") O(a’ — x"). 


exp [il(a’ — w”)|dLdl= 


(*) The contents of this section were a part of the article published in the Soryu- 
shiron-kenkyu (mimeographed circular in Japanese), 5, 49 (1953). 
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Here, the invariance of F(x', x", x") under the inhomogeneous Lorentz trans- 
formations is taken into accont: There are different properties between 
the case, where we are concerned with the derivatives of finite order, and the 
case of the derivatives of infinite order, as will be seen later. But, these dif- 
ferences seem to have nothing to do with the discussion about the integrability. 

For simplicity, we begin the discussion with the particle dynamics in which 
the dynamical variable is q. The transformation function from the time t, 


to ty is given by 
=|.-fexe [iS] A pat) , 


7 | .  dg() 
S = [tt q) at , and Per di ’ 


here 


when the Lagrangian of the system is described by L(q, 9) (*). If we divide 
the time interval ty — t, into small divisions whose intervals are e, and ex- 
press the velocity at each divided time as g, — lim (isa — qi)Je, we can reform 
the integral in S into the sum, 


dq. d d 
D = fi DONS Be à 


where À is a normalization constant. We can split the sum into two parts 

«in such a way that the one contains only variables qg; (è > k) and the other 
N-1 k-1 

contains q; (j <k), that is, into Y L(4:+1, 4) and Y L(4;+1: 4;). By this split- 
i=k 


j=1 
ting we have the composition law, 


« dq. 
x.) = fre a 1) 


Namely, we can solve the problem, by giving an initial value q, at the time #,, 
in the two steps, first from t, to t, and then from t, to ty. Here we can take 
arbitrary times as ¢,, and this situation corresponds to the integrability in 
field theory. 
When the Lagrangian contains ÿ, we must treat L(qi dis di1) instead of 
L(qis1, 4), contrarily to the ordinary dynamics (because 4, = lim i (dia — di)/e). 
k—1 


Then the expression S becames DETTE dis Qi) +2 Lai) dir di); by the 


same procedure as used above. The first term conta qi (i>k—1), and the 
second term contains q; (j <k). Thus, the composition lave fails if we take 
only q as the dynawical variable, on account of the situation that the time 


(2°) R. P. Feynman: Rev. Mod. Phys., 20, 367 (1948). 
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division ¢,;<t<t, concerns with both terms. We must treat not only q 
as the dynamical variable, but also g, in order to get the composition law 
again (*). 

Next, we consider the system, whose Lagrangian is composed of the two 
parts, 


Dre L(g, q) + Li(q, q; q) 9 


4 is the complete dynamical variable for the part Z,, but we must take into 
account ÿ also as dynamical variable to treat the whole Lagrangian. Thus, 
we cannot consider the term L; as a perturbation. 

The higher the order of the derivatives involved in the Lagrangian, the 
broader the overlapping between K(N,k) and K(k,1). This overlapping 
becomes the whole time interval in the limit of infinite order. This case 
corresponds to the non-local interaction, with respect to this situation. 

The case, where the highest order of the concerned derivatives is finite, 
is different from the case of the infinite order derivative, in another respect. 
For definiteness, we take the system described by the Lagrangian 


d 
Dar (je. 
where F is supposed to be 
d dice Ne 
ra) ee GC + cece): 


The second factor can be rewritten as an ensemble of N harmonic oscil- 
lators (22), and the first factor cannot be connected to the harmonic oscillator 
model. If the highest order of the derivatives is finite, we can always factorize 
F as in the second factor. But, there appears such a factor that we cannot 
reduce to the harmonic oscillator as the first factor, when we concern with. 
the Lagrangian involving derivatives of infinite order. In field theory, we 
quantize the Fourier coefficient of field quantity, on the base of the corpuscular 
image, when the coefficient constitutes a harmonic oscillator. Thus, the part, 
corresponding to the first factor in the above example, does not increase the 
degree of freedom of the field in this sense. But, the situation concerning with 
the composition law is not different from the preceding discussion. This can 
be seen also from the analysis of Pais-Uhlenbeck. 

PAULI (4) classified form factors into two classes, normal class and 
abnormal class. If the manifold of the solutions of the system with the non- 


(*) We have the method of Ostrogradsky (21) to treat both q and q as dynamical 
variables. This method is modified by Pais and Uhlenbeck (2?) so as to adapt it to 
the harmonic oscillator model. If we extend Ostrogradsky’s method. to field theory, 
there appears a new difficulty in the case of non-local interactions, as will be shown 
in Appendix Il. 

(21) E. T. WHITTAKER: Analytical Dynamics (London, 1937), fourth edition, p. 265. 

(22) A. Pars and G. E. UHLENBECK: Phys. Rev., 79, 145 (1950). 
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local interaction is the same as the one of the free system, the form factor 
contained in the interaction is defined to belong to the normal class. The 
form factor which does not have such a property belongs to the abnormal 
class. According to this definition, the interaction with the form factor be- 
longing to the normal class does not increase the degree of freedom of the field, 
and has such a property as the first factor in the above example. In soluble 
simple problems, we can find the form function of the normal class, actually. 
For instance, in the equation of motion 


q(t) + q(t) = fo —t) q(t) dt", 


such form functions as 


1 
Dt). = — mu (a> 0) 
and 
11/7 . 
P(t) = Gela) (a> 0) 


belong to the normal class. But, the problem of the integrability of the theory 
is not solved only by the existence of form factors of this class. 

In the case of non-local interaction, we cannot pick up higher derivatives 
as dynamical variables, because higher derivatives do not appear explicitly. 
On the other hand, we cannot specify the state at the sharply defined time in 
this case. Thus we must pursue the S-matrix connecting the asymptotic form 
in the infinite past to the one in the infinite future, without following the sharp 
times step by step (*). The form functions of the abnormal class must be 
excluded also in this method, because the S-matrix ought to be unitary. Itis 
not proved that we cannot construct the unitary S-matrix in the case of the 
form function belonging to the normal class. But we cannot find such a 
method yet, and it seems to be impossible, as will be anticipated from the 
preceding discussions. We cannot determine whether the concerned form 
factor belongs to the normal class or to the abnormal one, until the solutions 
are obtained explicitly. So, it seems to be hopeless to get such an inclusive 
formulation that this difference is implied from the beginning, in place of 
Lagrangian formalism (*). ; 

Here we explained the situation about the integrability by the simple par- 
ticle dynamics. The situation in field theory is clearly seen by these discus- 
sions, already. The scalar field, interacting with itself through bilinear inter- 


(*) In the theory of non-local interactions, the conservation law can be obtained 
only in the integrated form. On the other hand, in the theory involving higher deri- 
| vatives of field, we can derive the conservation law of differential form. (See Appendix II). 

(+) In the local theory, the consistency between wave equations, commutation 
relations, Schrodinger equation and conservation laws ete., is automatically guaranteed 


in the Lagrangian formalism (?). It will be necessary to construct any formulation 
that plays such a role. 
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action, corresponds to the particle dynamics, as it is. For the case of spinor 
field, and of multi-linear (except for bilinear) interaction, we must investigate 
in detail whether the concerned interaction is local or non-local (*). For 
instance, we can see at once from our discussion that the interaction of the 
form, (LIA(«))*, belongs to the interaction with higher derivative, as already 
mentioned in the footnote of Sect. 4 We must bear in mind that the form 
functions of normal class also are excluded in the scheme of Lehmann-Symanzik- 
Zimmermann (°) discussed in Sect. 4. It is due to the fact that we imposed 
the causality on the vacuum T-functions implicitly as will be seen from the 
discussions in the text. The form functions of abnormal class are excluded 
owing to the asymptotic condition. 

It seems to be impossible to construct any satisfactory formulation for 
non-local interactions. If experiments (2?) suggest some non-local interaction, 
we may interpret this character as a secondary effect through some inter- 
mediate field. (Coulomb force and nuclear force have such character). If we 
aim at avoiding divergences by non-local interactions, we may consider such 
a possibility as following. The order of divergence in the self-energy of electron 
was diminished by the appearance of the new degrees of freedom, o- and 
o-matrix, and by using the hole theory. Now, we have new degrees of freedom 
concerning the iso-space, which are introduced in the phenomena of the 
so-called strange particles. If these degrees of freedom are connected with 
the Minkowski space in any sense, the situation concerning divergences will 
be altered (*) (24). The other possibility might be to construct a field theory on 
the modified particle image, as mentioned in Sect. 7. 


(23) For example, R. HorsTADTER: Rev. Mod. Phys., 28, 214 (1956); D. R. YENNIE, 
M. M. Levy and D. G. RAVENHALL: Rev. Mod. Phys., 29, 144 (1957). 

(*) In this possibility, it is difficult to introduce the so-called universal length (25). 
The aim to introduce the universal length was not merely to formulate cut-off (or to 
take into account the classical extension of particles) in field theory, but it was expected 
for new duality to appear at the small dimension. Non-local interaction also is only 
a temporal technique, in this respect. 

(24) T. OKABAYASHI: Prog. Theor. Phys., 10, 499 (1953). The situations about 
the strange particles were not clear at that time. Recently, have appeared new expe- 
rimental facts that seem to support the existence of the correlation between the 
Minkowski space and iso-space. In the strong interaction, the hyperonic charge, 
the isotopic spin, and the parity are conserved. In the weak interaction, the first two 
quantities are not conserved, and the parity seems not to be conserved in almost all 
weak interactions. 

Cc. S. Wu, E. AmBLER, R. W. Haywarp, D. D. Hoppes, and R. P. Hupson: 
Phys. Rev., 105, 1413 (1957); R. L. GARWIN, L. M. LEDERMAN and M. WEINRICH: 
Phys. Rev., 105, 1415 (1957); F. EISLER, R. PLANO, A. PRODELL, N. Samios, M. 
SCHWARTZ, J. STEINBERGER, P. Bassi, V. BoreLLI, G. PupPI, H. TANAKA, P. Wato- 
scHEK, V. ZoBoLi, M. Conversi, P. FRANZINI, I. MANNELLI, R. SANTANGELO, V. 
SILVESTRINI, D. A. GLASER, C. Graves and M. L. Peri: Phys. Rev., 108, 1353 
(1958). 

An analysis in which these situations are taken into account, is reported in our 
recent article published in Prog. Theor. Phys. 20. (1958). 

(25) W. HEISENBERG: Ann. der Phys., 32, 533 (1936). 
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APP ND El 


A Note on the Lagrangian including higher derivatives (*). 


We must take account not only of qg, but also of its higher derivatives 
as dynamical variables in order to keep the composition law, when the Lag- 
rangian of the system involves the higher derivatives of gq. OSTROGRADSKY 
proposed. a method to this problem in particle dynamics. In this appendix, 
we extend his method to field theory. Then, there appears a new contribution 
to the angular momentum, as will be expected. We restrict ourselves to the 
variation principle in classical field theory. The transition to the quantum 
theory is straightforward. Further, we suppose that there is only one scalar 
field g(x), for the extension to more general cases, where many fields with 
components interact with each other, is trivial. 

We consider the system described by the Lagrangian, 


op Gig ong 
8 = 2( 9. oo sree me): 
We denote the virtual variation of g by 5*g, and the variation of various 
quantities due to this variation by adding 3* in front of them. The notation à 
means the variation that comes from the infinitesimal transformation of space- 
time co-ordinates and field quantity. The virtual variation is commutative 
with the operation 0/0x", 


« ME NS, 


BE, a * 
à da” Preci di de 


on the contrary to 6. The total variation of g consists of the virtual va- 
riation $*g and the variation, which comes from the infinitesimal transformation 
of the co-ordinates, 


op 
A.II.1 p = d*p(a) + È di 
( ) ò d*9(2) San DE 


The field equation can be obtained from the stationary property of the action 
integral to the virtual variation, which vanishes at the boundary X of the 
concerned four dimensional domain X. 


pr —s*(e ax fre arno. | 
Lx 


(*) The contents of this appendix were published in the Soryushiron-kenkyu (mimeo- 
graphed cireular in Japanese), 4, No. 9, 37 (1952). 
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Here S*£ can be proved, by the induction method, to be 


‘ À lo] 
where 
n om CL 
ANSE {Eres 1)" RARES 
) [ Ip ci ) dx” dd 0(0™ p | da" om") 4 
n-1 GY, 
A.II.4 TS SR TS 
(ATLA) De ier 
‘ n—t-1 0 de 
A TI.5 Te t+1, 
( ) <p c( Ss - t) da”. . a CETTE Da. st 
and 
ese HO Op 
(A.IL.6) c(m,t)= Y ay(")/ |: 
v=0 


If we make the second term of è*£ vanish by the partial integration, we get 
the equation of motion, 


(A.II.7) | [2], = 0 


In order to obtain conservation laws, we impose that the action integral 
is stationary under the infinitesimal variation of x and g which includes the 
variation of the boundary, 


dI —frrear+ = Sa" dX, =| {are + so (£ dan) ATI 
x x x 
Then, we get 


BYE + — (2 Pal) = 0. 


= È 
This equation can be rewritten in the form 


9 
(A.IL.8) [Llp D = 5 U, 


where 


(A.IL.9) QE = — TW" — 2 ja = — > MES 


béni. 
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and 
n—1 ; € ot@ 
pi = lee fe Li ; 
(A.II.10) Ga => n° 50. Ed 


Here, eq. (A.II.1), (A.II.2), (A.IT.4) are used. The right hand side of eq. (A.IL.8) 
vanishes, owing to the equation of motion (A.II.7), 


A.IT.11 PME 0 
( tec ) dal A 


This equation expresses the conservation of WU”. 
The conservation law of energy-momentum is obtained by the infinitesimal 
translation of co-ordinates, 


da” = €” = const, 


(a) 0, 0 <t En À 
In this case, we have 
| AU" = TE, da, 
and (A.II.11) becomes 
resp 
0. 


TC", is the energy-momentum density. 
4 The infinitesimal homogeneous Lorentz transformation gives rise to the 
conservation of angular momentum. The concerned variation is 


deo (E, = — Sy) 
dp =0, 
op o.0 OP 
ra) = et ore (#0). 


fos LAO 
Then UV" is given by 
Ur = (a CT — DI) 
vo 2 
= d¢6 (Tan + Diva) 


2 
ETS QT" 
= #€ vo ? 


where (*) 
n-1 ot 
Dy miser È 
2, ni Ou? da 
(*) Av) = 4 yy — Ayì . 
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The equation (A.IT.8) becomes 


OI su 


Ox° 


oy 
== (Lia Dur fn = 0 . 


MO,» is the angular momentum density, and its first term is the orbital angular 
momentum density. If the Lagrangian does not involve the higher derivatives 
of y, there is not the second term of 977%,,. The reason why this term appears 
will be seen intuitively as follows. In the case that the Lagrangian contains 
the higher derivatives of g, (here the highest order is 2n), we need also higher 
derivatives of y, in order to specify the motion. In fact, the equation (A.II.9) 
shows that we need ©'p/üæx"…. (0 <r <n—1) (+), in order to express the action 
integral by means of the quantities on the initial and final space-like surfaces, 
as will be expected from the analogy with the conventional theory (?). Now, 
these quantities are tensors of higher rank, though @ is scalar. The second 
term of 977°,, shows that these tensors give contributions to spin angular mo- 
mentum density independently, corresponding to each rank. Such a situation 
will give raise to some difficulty in the theory of non-local interaction, because 
the clothed particle will have the spin different from the one of the bare par- 
ticle. 

We did not take into account the corpuscular image based on the har- 
monic oscillator model, in the above method. If we want to stand on this 
model we must use the method of Pais-Uhlenbeck (??). 


(+) Strictly speaking, these are to be 0"p/0t, for we cannot give the value of y 
and ©@p/ox' (i= 1, 2, 3) independently. 
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1. — Introduction. 


In treating problems involving the scattering of incident radiation (e.g., sound 
waves, radio waves) by a random medium, it has long been recognized that 
the random nature of the scattering medium plays an important part in atte- 
nuating multiple scattering. A familiar qualitative explanation of this pheno- 
menon is that the internal field produced in the medium by single scattering 
is considerably weakened by the «incoherence » or, to put it differently, that 
the contributions to the internal field from uncorrelated parts of the medium 
add like power rather than like amplitude. However, with the exception of 
the work of MINTZER ('), there appears to be little in the way of explicit cal- 
culations of the strength of multiple scattering in a random medium. The 
difficulty seems to stem mainly from the three-dimensional nature of the usual 


(*) The research reported in this paper has been sponsored by the Electronics 
Research Directorate of the Air Force Cambridge Research Center, Air Research and 
Development Command, under Contract no. AF 19(604)3495. 

(*) D. Mintzer: Journ. Acoust. Soc. Am., 25, 1107 (1953). 
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Scattering problems, which requires one to carry out difficult multiple in- 
tegrations in order to estimate the high order scattering. 

In the present paper, we treat the multiple scattering of incident plane 
waves by a random stack of dielectric slabs. Because of the one-dimensional 
nature of the problem, and because of the simple way in which the random 
stack is generated, we are able to carry the analysis of the effects of multiple 
scattering quite far. In particular, we find that we ean get an upper bound 
for the strength of scattering of any order which, unlike the crude estimates 
usually given, takes into account the random nature of the scattering medium. 
Our basic result is that when the number of slabs is large, the Neumann series 
for the mean square transmission and reflection coefficients converge much 
more rapidly than would be inferred from the smallness of the perturbation 
of the incident field by the scattering medium. This additional attenuation 
of multiple scattering is directly attributable to the random nature of the 
scattering medium. However, it is found that the randomness of the scattering 
medium is no ionger effective in reducing multiple scattering when the order 
of the scattering exceeds the number of slabs in the stack. 

Despite the evident difficulty of extending the analysis to three-dimensional 
scattering problems and more complicated scattering media, there is every 
reason to believe that our qualitative conclusions are quite generally valid. 


2. — Formulation of the problem. 


Consider a stack of N dielectric slabs, all of the same thickness d. Choose 
the origin of a Cartesian system in the lower face of the bottom slab, and let 
the z-axis point upwards in the direction 


normal to the slabs (*). The configuration Sereno 
is illustrated in Fig. L: 7 È EGO "; 
The random function «(z) which de- 


1 
( 
1 


seribes the dielectric constant profile of 


z 


the stack of N slabs is constructed as i o È 
follows. Let f be a Bernoulli random vari- Psi! 
able, which assumes the values —1 and STILI : È 


+1 with equal probability, 7.e., with pro- 
bability 4. To obtain a realization (sample 
function) of the random function e(z) we Fig. 1. — The scattering medium 
perform a sequence of N independent trials, (schematic). 

each consisting of an observation of f, and 

we denote the outcomes of these trials by B(1, x), B(2, @),..., B(N, a). (The 
index « ranges over a sample space S consisting of 2” points, each of which 


x 


(*) The slabs are regarded as infinite in the x and y directions. 
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corresponds to a possible outcome of the sequence of N trials). Then the 
realization indexed by x of the random function «(z) is given by the piecewise 
constant function (*) 


al ’ 2; 
(1) ee, «) = 1 + Ae(z, ~) =} 1+ Br, a)Ac, (r —1)d <e<rd,1<r<N,Ae>0, 
ni i <> Nd. 


For example, one might toss a well-balanced coin and assign one of the slabs 
the dielectric constant 1+Ae if the outcome is a head, and 1 — Ae if the out- 
come is a tail, then toss the coin again and assign any other slab the dielectric 
constant 1+ Ae if the outcome is a head and 1 — Ae if the outcome is a tail, 
and so on until all N slabs have been assigned values of the dielectric constant. 
We shall often indulge in the usual abuse of notation, whereby we suppress 
the index « and use the same symbol to denote both a random variable and 
the values it assumes, or to denote a random function and its realizations. 
Suppose now that a scalar plane wave exp [ik] is incident from below on 
a random stack of slabs generated in the manner just described, and that the 
resulting field (2) obeys the inhomogeneous wave equation (+) 
(2) 


he + k?| u(e) = — k? Ae(z)u(z) « 


dz 


Since Ac(z) is a random function, the same is true of «(2); the index « has been 
suppressed, so that (2) can be regarded either as a stochastic differential equa- 
tion, or as the wave equation obeyed separately by each realization of (e). 
The integral equation corresponding to (2) is 

Nd 
(3) u(2) = exp [ike] — [ate 2’) Ag(z")u(2") dz’, 


0 
where 


1 | 
Œ(e, 2!) = dik XP [ik|2—e'|], 


is the one-dimensional free-space Green’s function, and the incident wave 
exp [tkz] is, of course, a solution of the homogeneous wave equation. In writing 
the limits of integration, we have used the fact that by construction the random 


(*) To avoid confusion, note that A is used exclusively to designate an increment 
of the quantity it precedes, and is never used as an independent symbol. 

(+) It is also assumed that u(z) and du(2)/de are continuous. For simplicity we 
have chosen the dielectric constant of the medium in which the stack is immersed 
to be unity, with the result that e(z, «) can be less than unity. Those who are dis- 


turbed by this can replace e(z, «) in (1) by e(z, a)/eg, Where & > 1 and «(1 — Ae) >1; 
equation (2) remains the same. 
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function Ae(2), which is the dielectric constant profile centered about its mean, 
vanishes outside of the interval 0 << Nd. The formal solution of (3) is 
given by the Neumann series 


Nd 
u(z) = exp [ike] + = exp [tk |z—2,| | exp [tkz,]Ae(z,) de, — 


0 
Nd Nd 


al fexp [ik |e—z, | |exp[ik|z,—2,| ] exp [ike] Ae(z,) Ae(2,)dz, de, + ... 
(4) 0 0 


=1+ Sia) | È fe Likle—a|lexplikle—#|]..explitle, 2,1]: 


‘exp [ike,]Ae(2,) Ae(z) … Ae(z,) de, de, … de, , 


where, for the moment, we leave aside the question of convergence. 

It is particularly convenient in one-dimensional problems of this kind to 
describe the solution u(z) in terms of a transmission coefficient 7 and a re- 
flection coefficient R, defined by the relations 


u(2) = T exp [tke], e> NA, 
(5) 
u(z) = exp [tke] + R exp [— tke], 2 10). 
It follows from (4) and (5) that 7 and R are given by 


Nd Nd 


Na 
; 2 
LES EE - Ae(e,) de, — 57] fee [— ike,] exp [ik|2, — |] exp [tke,]- 
9 0 0 


-Ae(z,)Ae(2.) dz, des + … 


(6) Nd Nd Nd 
© ike n / n = re 
=1-+ > (5) | fer [—ike,] exp[ik|z, —2,|]...exp[tk|z,.—2,| |- 
n=1\ 
ae -exp [ikz, |Ae(z,) Ac(2,) ... Ae(z,) desde, ... den, 
and 


Nad Nd Nd 


; 2 
hi = exp [2ikz,]Ae(z,) de, È | | exp [tkz, exp[tk |z,—2, | |exp [ike,]- 
0 0 


È -Ae(2,)Ag(z,) de de, +... 
(7) NA Nd Na 
a 


= > (CT [fes Likelexplik|e—2||]..explik|2, —2,]]explike,]: 
n=1 
a, -Ne(2,) Ae(2,) .… Ae(2,) dz, des ... deg, 
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respectively. We remind the reader that T and R are random variables, à fact 
which may be emphasized by writing T(x) and R(«), i.e., by explicitly indi- 
cating the realization index x. Since the random function Ae(z) is constant 
in each slab, the multiple integrals appearing in (6) and (7) reduce to sums. 
Thus, using (1), we can write (6) as 


a hen 3 (ey À Say Pay ny Pu) BU» &)B Urey @) Blu, à), 


av Ty =1 


where the coefficients a(7,,72,..-,%n) are defined by 


(9) cal. ri. Tn) = if | 2. [en [— ike,] exp [ik|z,—2|]... 
(i—1)d (r3—1)4 (rn bd 


“exp [ik|2, — 2n-1| | exp [ike,] de, de, … den. 


The summation with respect to the indices r,, 73, ..., 7, extends over all the 
N” distinct n-tuples (r,, 72, ..., 7n). Similarly, we can write (7) as 


(10) R(@) = SP SW tay is r)Arr» IB Cay à) oe Blas 2) 5 


where the coefficients d(r,, 73, ..., 7,) are defined by 


rd rol l'nd 
(11) bri 153.004 rs) = if | fox [ik2,] exp [ik|2, — |] 
(ri bd (3-1 d (fn DA 
“exp [ik|en-1 — 2n| | exp [tkz,]dz, de, ... de, . 


It is important to note that 
(12) |AF13 Tay ces Mn) | Sd, [b(ri 5 Tay ce Ta) |< A", - 


which follows at once from the fact that the integrands in (9) and (11) are 
complex quantities with unit modulus. 


Turning to the question of the convergence of the series (8) and (10), we 
note the inequalities 


MES CENTS 100152). fr, a =1+ 5 (ET), 


Rois SMS) rsa) Bre SI 
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where we have used (12). It follows at once that the Neumann series (8) and 
(10) are absolutely convergent if 


(15) A= 


It should be noted that the convergence condition (13) guarantees the con- 
vergence of the Neumann series for 7(x) and R(«) for all « in S. 

In the remainder of this paper we shall be concerned with the mean square 
values of T(x) and R(«), since these are the quantities of greatest practical 
interest. Therefore, we now convert the Neumann series (8) and (10) into 
corresponding Neumann series for <|7'|?> and <|R|?>. By the angular brack- 
ets we mean the ensemble average, 1.e., the average over all realizations of 
Ae(2). More explicitly, if f(x) is a function of the point « of the (discrete) 
sample space S, then 


a)» = Y p(a)f(a), 


ains 
where p(x) is the probability of the event corresponding to «; in our case, 
S is the space consisting of the 2” possible outcomes of a sequence of N inde- 
pendent Bernoulli trials, and p(x) has the value 27” for every « in S. Thus, 
multiplying (8) and (10) by their complex conjugates and ensemble averaging, 
we find 


kA 2n 
(14) <IT| 2» = 1 + > FF £ DE [atri 3 Tan) FA (114.005 Tan |< BC 11) ++» B(Ten)> + 


seer Ton 


n=i\ 2 A 


TEA 
+£( a} 20 Dre Darren APT) BT) Bales 


t,m>0 


and 


CRD = SENS NS Son, I] 
CB) «= BCD) + BG) 


l,m>0 


where we suppress the index «, and make use of the fact that averages of 
products of an odd number of the random variables f(r) vanish. It is easily 
verified that the sums of the terms in (14) or (15) which are of a given order 


in Ae are real. Since 


(16) |KB(11) «+ B(r1) B(81) + Bl8m)> | <1, 
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we have the inequalities 


< kNdAe\" 
(17) (IPP) <1 +5 On +1) (5°) 
and 

l kNdAe\" 
(18) (|B) < > (2n — 1) | e 3) 1 


Just as one would expect, these series converge if and only if the condition (13) 
is satisfied. 

We shall now show that, because of the random nature of the scattering 
medium the convergence of the series (14) and (15) is usually more rapid than 
one would infer from the convergence condition (13). To show this, we must 
first prove an auxiliary inequality. 


Basic inequality. — Let c(r,,72,...,7,) stand for either the coefficient 
A171, 725, Tr) defined by (9) or the coefficient (T1; T2, +++) %n) defined by (11). 
Let B(r), 1<r<N, denote N independent random variables, each of which 
takes the values — 1 and +1 with equal probability. Then, if 7,m>0 and 
l+m=2n, 


(LOI DAI 2143 Ta) *(81.--58m)B(1)--B(71)B(81)--B(8m)d | < AXES BOY"). 


Pal Ty 845 oon 8, 


Proof. — Writing the quantity 
(20) <P(r1) «++ B(r) B(81) «++ B(8m)> 


in the form <("(1)f®(2)... B°x(N)>, where e,, €, :.., e, are suitable non-negative 
integers with sum 2», we see that (20) is non-negative, since it either vanishes 
(if any of the e, is odd) or is positive (if none of the e; is odd). Moreover, 
by (12), the coefficients Oise PE) OTB, 5 asp Ba) appearing in (19) all have 
absolute values < d2". The inequality (19) then follows from the obvious 
fact that if A, B,..., X are non-negative and if [a], |b|,..., |el< È, then 
|aA+bB+...+@X|< C(A+B+...+X), 
and from the identity 


SI > <B(11) + B(1) B(81) ver P(8m)) = <S Bir) P™>, l+m=2n. 


Pire TL Spe Sm 


Finally, we note that the inequality 


(21) DI AT y +». Tan)KB(r1) «+» B(r2n)) |< ances B(r)]}?") 


Les Ton real 


can be proved in just the same way. 
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3. — The stochastic majorants. 


The qualitative idea behind the considerations of this section is the fol- 
lowing: As shown by (13), the requirement AN = kNdAe/2 <1 is sufficient. 
to guarantee the convergence of the Neumann series (14) and (15) for<|7'?) 
and <|f|?>. On the other hand, it is clear that AN is not an appropriate 
quantity to use in estimating the rate of convergence of the series (14) and (15). 
To see this, we observe that we have not taken into account the random 
nature of the scattering medium in deriving (17) and (18), except, of course, 
insofar as we have used the fact that expectations of products of an odd 
number of B(r) vanish. Indeed, by writing 


(16) POU Be) "hs. <1, 


we have automatically eliminated any effect that the statistics of the problem 
might have in attenuating high order scattering. Since, when N is large, most 
of the realizations of Ae(z) exhibit many changes in sign, one might expect, 
to take a rather extreme example, that the internal field produced in the stack 
of slabs by single scattering is less than what it would be if the profile had 
the constant value Ae throughout the stack of slabs (*). The rather loose 
phrases « coherent scattering » and «incoherent scattering » are ordinarily used 
to distinguish between these two situations. We shall now apply the lemmas 
of the preceding section to derive majorant series for the absolute value of 
the error committed in omitting terms of order > 2n, in the series (14) and 
(15) for <|T|?) and <|R|*>. These majorants take into account the random 
nature of the scattering medium, i.¢., the oscillatory nature or « wiggliness » 
of most of the realizations of Ae(z), and will accordingly be called stochastic 
majorants. We shall find that, although the condition AN < 1 is still needed 
to ensure the convergence of the stochastic majorant series, a fact which is 
perhaps surprising at first, the rate of convergence of the stochastic majorants 
is usually much faster than would be inferred from the size of AN. 

The derivation of the stochastic majorants follows almost immediately 
when, instead of the crude inequality (16), we use the more refined inequa- 
lities (19) and (21), proved in the preceding section. Thus, except for the 
factor (k Ae/2)2", the term of order 2n in the sum (14) is the sum of 


(22) DI (AR taime. KP)" E) 


(*) In our case, there is probability 2 of this occurring. 
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and 
(23) 2 D ZO A*(813 +++ Sm)<B(11) +. B(r1)B(81) + B(Sm)> - 


Using (21), we see that the absolute value of (22) is bounded above by 


2d"<[> B(@)}?"). 


r=l 


Moreover, using (19), we see that the absolute value of (23) is bounded above by 


(2n — 1)" LE B(r)P"). 


r=1 


It follows at once from the form of (14) that if Z,(2n,) is the error committed 
in neglecting terms in (14) with orders > 2%, then 


(24) <3 (2n 41) 


n=No 


i <B> 0<m< 09, 


where the zero-mean random variable f, is defined by 


N 
(25) By= > BO), 

r=1 
1.e., Py is the sum of N independent random variables, each of which takes 
on the values — 1 and 1 with equal probability. The distribution of B, is, of 
course, the familiar binomial distribution of probability theory, for which 


(26) Prob {By = N — 2r} — (7), O0<r<N. 


The difference between the right hand side of (24) and the corresponding 
terms of (17) is that we have 


A 

(27) (Ba) = 2 ÿ(,) (N — 2r}°", 
in place of N?". Thus, the crudeness of the estimate (17) consists in replacing 
<BN> by the value N2", i.e., in failing to take account of the fact that the 
random variable 8} takes on its maximum value of N?» only with the pro- 
bability 2'-*, which is small when the number of slabs is large. 

In just the same way, using (15), (19) and (21), we find that if H,(2n)) is 
the error committed in neglecting terms in (15) of order > 2%, then 


(28) | Bp (2ro)| < Ÿ (2n—1) ES (87%, 1< gi 


nano 


MULTIPLE SCATTERING BY A RANDOM STACK OF DIELECTRIC SLABS 635 


which again differs from the corresponding terms of (18) only in the important 
respect, of having <B> in place of N?". Since the majorant series (17) and (18) 
do not take account of the random nature of the scattering medium, we shall 
refer to them for brevity as the sure majorants, using the word sure as an 
antonym for stochastie. 


4. The binomial moments. 


To proceed with the analysis of the stochastic majorants, we must first 
study the behavior of the binomial moment <6Ÿ> as a function of N, the 
number of slabs, and 2n, the order of the moment (the odd moments vanish). 


For convenience, we introduce the scaled random variable. 


US 


Pa = AB, ; 


and deal with the corresponding scaled binomial moment 
Sin FANS ser 
<BR> = cubo = ((p,)"), 


rather than <6"). It follows from <(AB,)*™) < (AN) and AN <1 that 
<p2"> approaches zero with increasing n, unlike <6"). By (27), we have 


A NIN 
(29) dm = 2S (7) [aur — an), 


obviously a monotone decreasing function of 2n. It is to be expected that 
the behavior of <f2") depends on the relative size of 2n and N. 

Qualitatively, the conclusions which we shall establish in this section are 
the following: When 2n< N, dre can be estimated by the corresponding 
(scaled) moment <g*"> of g,, the zero-mean Gaussian random variable with 
the same variance; this is reasonable in view of the central limit theorem of 
probability theory. As 2n approaches N, i.e., as the order of the moment 
approaches the number of slabs, this approximation breaks down completely, 
and we must use (29) to find de) Indeed, for fixed N, the quantity <g> 
instead of going to zero becomes infinite as n increases. When 2n > N, the 
scaled binomial moment <f?" can be estimated by the quantity 2'"(AN)??, 
which goes to zero exponentially as n increases. However, as is apparent 
from this exponential estimate, the condition AN < 1 is still needed to ensure 
the convergence even of the stochastic majorants. 

We now discuss these matters in more detail. 
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41. The Gaussian estimate of ie? — To obtain the Gaussian estimate 
of CB, we consider the zero-mean Gaussian random variable g, with the 


same variance as f,, é.e., with variance A*N. The Gaussian moments are 
given by (?) 


(30) 4g? = 1-3... (2n—1)A*N®, 


N 


while all odd moments of g, vanish. Moreover, we have the inequality 
(31) Cpt LEGS, eee NET ae Ola i soa 


which means that the Gaussian moments are upper bounds for the binomial 
moments. To show this, we first observe that 


fre pw); 


where È), 1<7r< N, is a sequence of independent Bernoulli random va- 
riables, each of which takes the values — A and +A with equal probability. 
We then write 
N 
In = > G(r) ; 
rel 
where g(r), 1<r < N, is a sequence of independent Gaussian random variables, 


each with mean zero and variance 4?. It follows by the multinomial expan- 
sion that 


(32) Br = > 


Mises 


(2n)! 
my Ma DA 


<B™(1)> ... <B™™(N)> , 


a 2n)! 
(33) gr = > Ri i <g™(1)> ... ga (N)), 


mont My mil... My! 
where the m,, 1<r<N, are arbitrary non-negative integers subject to the 
condition m,-+m,-+...4+ my, = 2n. Since all the moments of B(r), g(r), 1<r<N, 
are non-negative, the desired inequality (31) follows at once from a term-by- 
term comparison of (32) and (33), together with the obvious inequalities 
<B> = A™<1-3... (2n —1)A™ = Cg , 
(Bim) = 0 <0 = Gin, 0<n< 00, 


where B denotes any of the È) and g denotes any of the g(r). 


(*) H. CRAMÉR: Mathematical methods of statistics (Princeton, 1946), p. 212. 


= 
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Applying the DeMoivre-Laplace limit theorem (3) (the special case of the 
central theorem appropriate to a sum of independent, identically distributed 
Bernoulli random variables), we would expect to find that de is approximately 
Gaussian, with mean zero and variance A?N, to an approximation which im- 
proves as N increases. The difference between the distribution of DI and that 
of gy is greatest in the tails of the distributions; in particular, the values of 
(A are confined to the interval (— AN, +AWN), whereas g, takes on values 


outside any finite interval. Thus, for fixed N, we can expect <g%"> to be- 


2n 


come a poorer approximation to <f,"> as n increases. In fact, for large n we 
can use Stirling’s formula 


n!=V2xn"*"?texp[— n] 


to derive the approximation 


€ 2 n 
(34) cof) va (EST 
It follows that ultimately <g¥"> goes to infinity as n increases, whereas, as 
already noted, <p" must go to zero as n increases. (We assume, of course, 
that the condition AN < 1 is satisfied). 
Using (34), we easily see that the min- 1 
imum value of <g% is obtained for 


di 


(35) 2 ~ AN? 


and is approximately 42 exp [— n,]. 
These properties of the Gaussian esti- 
mate of <8") are illustrated in Fig. 2, 
an 


where we have plotted <p") and <gY > 
for the case N=20, AN=0.99. It ™ 


10° 
Fig. 2. — The solid curve represents the 
scaled binomial moment <87) as a function 10°. —Exponentia 
of 2n, for N=20 and AN= 0.99. The Ce 
dashed curves give the Gaussian and expo- : 

10° 


À, list 
nential estimates of (fy. 0 10 20 30 40 50 60 70 80 90 100 10 120 


(3) W. FELLER: An introduction to probability theory and its applications (2™4 edition), 
(New York, 1957), p. 172. 
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will be noted that the Gaussian estimate of <B2"» is good for n<10, and 
then deteriorates rapidly. The quantity <g%> is a minimum for 2», = 
= 20~ (A?N)-1, in agreement with (35); it then increases rapidly and ulti- 


mately diverges. 


42. The exponential estimate of <B>. — When 2n>>N, we have the ap- 
proximation 


Bey = 2 Any, 


which improves as increases. To see this, we examine the terms of the 
sum (29) in the limit as n + co. Thus, for example, the ratio of the tail 
term corresponding to r=0 to the term corresponding to r —1 is 


it N 2n 
VN 21 


which goes to infinity as n — co. Applying this argument to the other 
terms in the sum (29), it is clear that as n increases, most of the contribution 
to <p") comes from the two tail terms corresponding to r=0 and r= N, 
which in the limit of infinite n completely dominate the other terms of the 
sum (29). It follows that for large n 


5 N N 
Br» ~ (9) (AN) + 2) (AN) = 2A), 


as asserted. In particular, the condition AN <1 is still needed to ensure 
the convergence of the stochastic majorants. 

The exponential estimate 2'*(AN)? is drawn in Fig. 2 for the case 
N=20, AN=0.99. We see that in this case {B®"» becomes asymptotic to 
its exponential estimate for 2n = 70. The portion of the <B> vs. 2n curve 
for which neither the Gaussian estimate nor the exponential estimate is good 
must be filled in by using (29). 

Tables I and II give values of <p:" 
preciable, for N = 20, 35, 50 and AN 
in estimating the strength of multipl 
section. 


>, in the ranges where they are ap- 
= 0.50, 0.99. The use of such tables 
e scattering is discussed in the next 
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TABLE I. — The scaled binomial moments Bes, for the case AN =0.99 and various values of N.. 


The numbers in parentheses are 


A 
the corresponding Gaussian estimates of 46%». 


2n N= 20 NE N= 50 2n N= 20 
.049005000 .028002857 .019602000 
2 38 .000001 830 
(.049.005000) | (.028002857) | (.019602000) oe 
i .006 964321 .002307671 | .001137346 i 
(-007204470) | (.002352480) | (.001152715) =) 000001058 
; .001593455 | .000310846 | .000108507 È 
(.001765275) | (.000329381) | (.000112978) 42 -000001 571 
.000492723 | .000057478 | .000014296 
3 (000605551) | (.000064565) | (.000015502) 44 -000001476 
.000188968 | .000013395 | .000002389 
i (.000267075) | (.000016272) | (.000002735) 26 -000001 398 
000085391 | .000003740 | .000000481 ; 
12 (.000143968) | (.000005012) | (.000000590) 48 -000 001 332 
.000043934 | .000001209 | .000000113 ; 
14 (.000091717) | (.000001825) | (.000000150) 50 -000 001276 
.000025112 | .000000442 | .000000030 7 È 
16 (.000067419) | (.000000766) | (.000000044) 52 -000001227 
000015654 | .000000179 | .000000009 
15 (.000056166) | (.000000365) | (.000000015) 54 -000 001185 
000010490 | .000000080 | .000000003 : 
20 (.000052296) | (.000000019) | (.000000005) 56 -000 001 147 
22 000007469 | .000000038 | .000000001 58 000001113 
24 000005597 | .000000020 e 60 -000 001 081 
26 .000 004 380 .000000011 less 62 .000 001 053 
28 .000003555 | .000000006 than 64 .000 001 026 
30 -000002977 | .000000 004 10-9 66 .000001001 
32 000002559 | .000000002 = 68 .000000978 
34 .000002248 | .000000002 = 70 .000000956 
36 .000002013 | .000000001 A 72 000000 934 
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TABLE II.— The scaled binomial moments es for the case AN —0.50 and various values of N. 
The numbers in parentheses are the corresponding Gaussian estimates of <7» 


2n N= 20 N=35 N= 50 
a .012500000 .007 142857 -005 000000 
i: (.012500000) (.007142857) (.005000000) 
.000453 125 .000150 146 .000074000 
4 (.000 468 750) (.000 153061) (.000075 000) 
.000026 445 .000005 159 .000 001 801 
6 (.000 029 297) (.000005 466) (.000001 875) 
.000 002 086 .000 000 243 .000000061 
8 (.000 002563) (.000 000 273) (.000 000 066) 
.000000 204 .000000014 .000 000003 
10 (.000000 288) (000000018) (.000 000 003) 
x .000000024 -000000001 uan 
12 (.000000040) (.000000001) 
.000000 003 
A =o a 
14 (.000000006) less than 10 


5. — Analysis of multiple scattering. 


The application of the results obtained in the preceding section to the 
problem of multiple scattering in a binomially distributed, random stack of 
dielectric slabs is straightforward since the terms of the stochastic majorant 
Series (24) and (28) are just the binomial moments (pes multiplied by the 
growth factors 2n+1 and 2n — 1 (*). To illustrate the situation, we plot in 
Fig. 3 the general term (2n — 1) "Sha of the stochastic majorant series (28) 
for |Z,(2n)|, for the case AN = 0.99, N= 20. Comparing Figs. 2 and 3, We 
see that in both cases the terms at first fall off rapidly with increasing n, then 
fall off less rapidly as 2n approaches N ; and finally approach an asymptotic 
curve corresponding to the exponential estimate of Rae The exponential 


(*) These factors represent the number of terms in the Neumann series for T(x) 
and R(x) which combine to give the term of order 2n in the Neumann series for 


IR?» and <|T'|*>. 


ee d. hé. 
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estimate of the general term of the sto- 100 = T 
chastie majorant series (24) is given by | a . 


— 


Sure majorant 


(36) (2n —1)<B™>~(2n—1)2)-*(AN)™. 


The quantity (36) is a maximum for J 


al 
ON le 10! 
ss log, AN 
For the case AN = 0.99, this maximum 10? 
occurs at 2n — 1 — 100 (see Fig. 3). * major ani 
For comparison, we also plot in Fig. 3 3 


the terms of the sure majorant series 
(18), which, it will be recalled, differ 
from the corresponding terms of the 0" 
stochastic majorant series (28) by having 
(AN)?" in place of Cp Thus, except oti Sa, si 
N ? 0 20 40 60 80 100 120 140 160 180 200 220 240 
for the factor 2*-*, the exponential 
estimate of the terms of the stochastic . Fig. 3. — The solid curve represents the 
majorant series is the same as for the stochastic majorant series (28), when 
sure majorant series. This is apparent N= 20 and AN— 0.99. The dashed ours 
from Fig. 3, where the curves represent- Va na Puro, alora eer 
; and the exponential estimate of the 

ing the terms of the sure majorant stochastic majorant series. 
series and the exponential estimate of 
the terms of the stochastic majorant 
series differ only by a vertical translation. The appearance of the terms 
of the stochastic majorant series (24) for | Z,(2n,)| differs only slightly from 
that of the stochastic majorant series (28) for |H#,(2%)|, due to the close- 
ness of the growth factors 2n—1 and 2n+1 for large n. 

It is now a simple matter to estimate the error committed in neglecting 
multiple scattering greater than a given order. We illustrate the general pro- 
cedure by two examples. 


51. Mean square reflection coefficient: AN = 0.99, N — 20. — As we have 
seen in Sect. 4, the absolute value of the error H 2(2%) committed by neglecting 
the terms of order 2n, and greater in the Neumann series (15) for <|R|?> is 
bounded above by the sum 


(37) Sn —1) BPS 


N=Nog 


To evaluate (37) for various values of n,, we use the data of Table II, to- 
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gether with the estimate 


foe) 


(38) S (2m —1)<B2"> ~ 217 "Ss (2 In — VAN)". 


n= 35 n=35 


The use of (38) is justified by the accuracy of the exponential estimate of 
<p» for 2n > 70. It is not hard to see that the right hand side of (38) equals 


69 — 67(AN)? 


1-¥( A N)'° (1 — (AN) = .000794 , 
whence |Z,(2n,)| is bounded above by (*) 
34 
(39) B(2n) ~ > (2n — 1)<B2"> + .000 794 . 


n =" 


The data presented in Table III were obtained by using (39). 


TABLE III. — B(2n,) is an upper bound for the absolute value of the error committed by 
neglecting terms of order >2n, in the Neumann series (15) for <|R\*>, for the case 
AN= 0.99, N=20. 


2 B(2n5) 2n4 | B(2no0) 
| 
4 .039 20 .003 
6 .018 30 | .002 
8 .010 40 | .002 
10 -007 50 | .001 


It is easy to calculate explicitly the first term R, of the Neumann series 
(15) for <|R|*>, since it is just the mean square absolute value of the random 
variable 


Na 
(40) dé exp [2ikz] Ae(z) de 


0 
The integral in (40) reduces to 


rd 
N 4 La N 
D Ae(r) | exp [2ike] de = Re > Ae(r) exp [ik(r —1)d] , 


(r-1)4 


(*) To ensure that (39) is an upper bound, one could multiply the second term 


in the right hand side of (39) by 1.1, say. This would not affect the data of Table IIT 
to the accuracy given. 


NE adi _4 


MULTIPLE SCATTERING BY A RANDOM STACK OF DIELECTRIC SLABS 643 


whence 


(41) Bi = Ain (5 ay 


kd 


On the other hand, the stochastic majorant for R, is, of course, the quantity 
A?N. It follows that R, is close to its stochastic majorant if kd<1, i.e., if 
the slab width d is small compared to the wavelength of the incident radiation. 
Since in the present case A?N = 0.049005, Table ITI shows that the first 
term in (15) and the absolute value of the sum of the next few terms may 
well be of comparable size. Indeed, if kd is large, we cannot even guarantee 
that single scattering predominates. 


5.2. Mean square transmission coefficient; AN =0.50, N=250, A2N =10-3, 
— We begin by showing that the second term 7, of the Neumann series (14) 
for <|7'|?> is non-positive. It follows from (6) that T, is the sum of the two 
terms 


ke no |2 
+ Feel) 


and 
Nd Na 


(43) — PS [ [exe [— ike] exp [ik|e, — 2, |] exp [tkze,] Ae(z,) Ae(z,) de, dz, ) 


+ complex conjugate . 


Since 
Na 
N 
Jace dz = d > Az(r), 
: Fpl 
(43) reduces to 
kd? N N 
(44) 1 <> Ae(r) > Ae(s)) = AN. 
Ta Sud: 


The integral in (43) need only be evaluated for the case where 2, and 2, both 
lie in the same interval from (r—1)d to rd, since otherwise the integral van- 
ishes. Doing the integration, we find that (43) reduces to 


(bt. — nr = (exp [2ikd] — 1) x (Ae(r))?)+complex conjugate = 
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Combining (44) and (45), we obtain 


which is non-positive. The quantity T, is bounded in absolute value by its 
stochastic majorant A?N, and is approximately — A*N for small values 
of kd. Furthermore, we note that the sum of 7, and the quantity È, given 
by (41) vanishes, a result which might be expected, since the sum of eu 
and <|R|?> must be unity by energy conservation. 

We now estimate the error committed when we approximate the mean 
square transmission coefficient by 


sin kd\? _ (sin kd\? 
(46) (|P 214 7, = 1— AN | Do J=1—10-( na |: 


thereby neglecting all terms of order > 4 in the Neumann series (14). To 
do this, we first sum the terms of order > 2y in the sure majorant series (17) 
for <|T{?), obtaining 


(AN)?” 
(1— (AN)?)? 


(47) x Qn + 1)(AN)" = [(2v +1) + (1— 2)(AN)2]. 


For the value AN — 0.5 under consideration, and for v= 15, this sum is 
approximately 10-8, which is an upper bound, albeit a very crude one, for 
the terms of order > 30, which were omitted in writing (46). The rest of the 
terms omitted in writing (46), with orders from 4 to 28, are bounded above 
by the series 


14 
(48) > (2n +1)<gx">, 

n=2 
î.e., by the Gaussian estimate of the corresponding terms of the stochastic 
majorant series (24), which, since N = 250, should be quite a good estimate, 
Using (30), and then adding (47) and (48), we easily find that for the case 
AN =10~* the absolute value of the error committed in using the approx- 
imation (46) is less than 1.6:10-5, say, which is only a small percentage of 
|T.| if kd<1. 


6. — Conclusion. 


We have seen that the absolute values of successive terms in the Neumann 
series (14) and (15) for the mean square transmission coefficient and the mean 
square reflection coefficient may fall off rapidly with increasing n, even if the 


4 — 


eS tina 


oe 
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convergence parameter AWN is close to unity. Indeed, an examination of the 
formula (30) for the Gaussian estimates of the binomial moments shows that 
the smallness of AVN (rather than of AN) is enough to ensure that the terms 
of the series (14) and (15) fall off rapidly in absolute value, at least until the 
order of the terms is comparable with the number of slabs N. In particular, 
if kd< a, then for sufficiently small AVN, the formulas 


(RP +R, CTP) = I ET, 


corresponding to the neglect of terms of order higher than two in the Neumann 
series (14) and (15), will be good approximations, even if AN is close to unity. 
(Note that AN~1 and AVN <1 imply that N >1.) One might adduce 
this example in support of the general belief that in the problem of radio 
scattering beyond the horizon, the large number of «scattering blobs » within 
the common volume of the antenna beams helps to justify the conventional 
use of the single scattering approximation for the scattered field. 

When the order of the terms is greater than N, we find that we must again 
use the size of AN, rather than the size of AVN, to estimate the rate of con- 
vergence of the series for <|7|?) and <|R|®). Qualitatively speaking, as n 
increases, the random nature of the scattering medium becomes progressively 
less effective in attenuating scattering of order n; finally, when n becomes 
large compared to the number of slabs, the randomness of the medium plays 
no further role in attenuating multiple scattering. 
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CoNTENTS. — 1. Introduction. — 2. Preparation of 9*(HP) for the appli- 
cation of MC. — 8. The MC approach. — 4. The computing programme. 
— 5. The spectrum. — Appendix. 


1. — Introduction. 


In the region of primary (lab) energies of the order of 1 to 100 GeV, the 
number of particles (mesons, heavy mesons, anti-nucleons, hyperons) created 
in a collision is of the order of 1 to 10. Consequently, pure thermodynamical 
calculations fail and if one wishes to apply a theory like Fermi’s (:) one has 
to calculate phase space integrals. We do not consider here the physical side 
of the problem, i.e. the validity of this type of theory. We only try to solve 
the mathematical problem of calculating the phase space integral, which has 
not been achieved in a simple and reliable manner so far. The integral, for 
n emerging particles with total energy Æ and total momentum P, is given by (?) 


(1) on(E, P) = ah fap, … dp, 6(P — > pi) ÔÜ(E — Vp} + mi) . 


This integral does not depend on the direction of P. In the center-of-mass 
system (C.M.S.) we have P= 0. In what follows, we shall consider frequently 
(1) E. FERMI: Progr. Theor. Phys., 5, 570 (1950). 


(*) The phase space integral is of course a function also of m,, m,,...m,. AS a 
shorthand notation we write o,(Æ, P) instead of Omoa (E P). 
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instead of (1) the unnormalized phase space integral oŸ(Æ, P) 
(2) on (E, P) = (2xh)*"-*-0,(H, P). 


The main difficulty in all attempts to perform the integrations have been 
the square roots Vp?+m?. The exact integration is possible only up to n= 3, 
for higher numbers of particles one has to integrate numerically. In principle 
this can be done by repeated convolution since 


(3) o(B, P) = J ina eV oh e 


and this is in fact the way BLOCK (*) has done the first few integrations. But 
for larger numbers » the repeated numerical convolution becomes either very 
inaccurate or prohibitively long. Several authors have tried approximations (*), 
but in many cases it is difficult to estimate the error. In unfavourable cir- 
cumstances, the result can be wrong by a factor five, even if for other cases 
the error is only some 10%. 

We may hope that by keeping the square root (instead of approximating 
it by either p; or p°/2m; as in most of the other calculations) but using a Monte- 
Carlo method (M.C.) for the integration, we may achieve an accuracy of a 
few percent. The method has the advantage that it gives in the course of 
calculating 0% (£, 0) automatically the energy spectra of each kind of particles. 

Exact spectra and phase space densities should be useful in extracting from 
future experiments a suitable representation of the matrix elements of inter- 
action by simple functions of energy, numbers of particles, etc. These functions 
(the «interaction volume » in the simplest case) may be considered as para- 
meters of the theory whose order of magnitude and general behaviour can be 
concluded from physical arguments but which still remain somewhat arbitrary. 
We feel that such a theory is not very satisfactory, but it may, as a half-empirical 
procedure, lead to good qualitative predictions about the reactions of elemen- 
tary particles between 1--100 GeV. Questions of this kind have turned out 
to be interesting since accelerators are under construction, which will yield 
protons of 25 and more GeV. 

In all that follows, we treat the particles as if they were distinguishable 
and consequently each one has a «label»: 1, 2, ..., n. In any application one 
has then to correct 0, by suitable factors (5). 


(3) M. M. Brock: Phys. Rev., 101, 796 (1956). 

(4) a) R. MILBURN: Rev. Mod. Phys., 27, 1 (1955); b) S. BELEN'K1J et al.: Usp. Fiz. 
Nauk, 62, 1 (1957); c) G. E. A. FraLHO: Phys. Rev., 105, 328 (1957) and various other 
papers quoted in a) and b). 

(5) Y. Yervin and A. pe-SHALIT: Nuovo Cimento, 1, 1146 (1956). 
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2. — Preparation of o; (E, P) for the application of MC. 


First of all, we do as much analytical evaluation of the integral as possible 
and apply MC only at the end. | 
We put 
Pi = Dit: 5 le:| = 1 
dp; = pì dp; de, = pì sin 0, d0; dg; dp; 
and have 


(4) em, P) = 


|. fan. dp, p? ... p2 6(H — Sv p? + m)-.[ > pie:)de,... den. 
i=1 i=1 


all directions 


In this way the whole integral is split into two parts. : 
The function 4 
Lx ; 

(5) WAP, Dis: Pa) = (=) fo@- 5e) de, ... de, | 
/ ì 


has a very simple meaning: w,(P, p, … p,) AP is the probability that for given 
© Pis Pr, but random directions e, ... e, the resultant vector Y p;e; lies in the 
neighbourhood dP of P. It is normalized such that 


œ 


(6) foste, Pi Dh)dP= ax] Pr, (P. D: Pa Pi 


0 


and it depends in fact only on |P|. We shall consequently refer to this func- 
tion as the «random walk function». An explicit derivation of this result 
and some further properties of the function w,(P, p, ... p,) are found in the 
Appendix. 

Introducing now the energies by 


(7) & = Vp? + mì, pie) =Vet— mi , 
and defining 
TEMERE] a; 
(8) ule) = va as sE a ; v,(P, E... En) = WE, Pi... Pn) ’ 


we may write (4) in the center of mass system (CMS) as 
(9) o5(E, 0) = (4x)"| de, …| dents (€1) ... Un(En) v2(0, 6: … En) Ô(E — De). 


n 
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A further transformation from the total energies to the kinetic energies 


(10) t= &— Mi ; m= M ; E-M=T 
gives 
any | 20 0) = tn clare dt, Dit, ... ,) (T-X), 


Dit) = ld a) Unltn + Mn) Vn(O, t+ m,,... ty + Mn) . 


A last transformation gives the final form to which MC will be applied: 


| t, =T, 
(12) Rent or, 
| ty = re Tia 


leads to 


o*(E, 0) = (4x) sr fen far. Jar Jane Ti; JTD); 


where now the 6-function drops out by integrating over T,: 


m IT ch 
(13) o*(E, 0) = (me far, far,...far, our, PETTINEO 


n—2 


3. — The MC approach. 


649 


We consider the following process: We choose independently and at random 
n — 1 numbers between 0 and 7 in such a way that the probability density 
for the single numbers is constant over the whole interval; we then order 


them according to magnitude. 


With N such ordered samples (Ti, TS... TI); (0(=1... N) we have 


D IZ T 
(14) Jar jar, Le OTT.) = 
0 Ti Ve 


n=l 


ceci — lim — 1 so CNIL 


(n —1) lwo NA 


) 
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È 
To prove the truth of (14), we note that the probability that the é-th ordered es 
sample fills just the cells A7,, AT.,..., AT, is (see Fig. 1) 


AT: A AT, AT,_1 = (n PL 
. DT T wee T [n-1 


! 
(n — 1) AT INTERNE 


and N times this probability is the total number of ordered samples filling 
these cells — they all lead to the same value of g. Hence (14) holds. 

This is the principle —it remains to 

write the formulae in a more convenient 


AT RÉAL AT Pipe toni 
pee PEUT, ha 7 It should be mentioned that one may 


hope for a fairly rapid convergence, 
since the sampling process has a ten- 
dency to draw preferably samples with 
T,~ T,—T,~...~ T—T,_, because for large n the points 7)” cluster around 
the value (i/n)T. Fortunately, it happens that the integrand has a maximum 
for those values of the variables that nearly correspond to equipartition of 
kinetic energies. Thus the samples tend automatically to pick out the most 
important terms of the sum and to neglect the small ones. This is the main 
advantage of MC applied to this problem as compared to a simple multidimen- 
sional numerical integration. The problem of convergence will be treated in 
the following paper. 


Fig. 1. — For the proof of (14). 


4. — The computing programme. 


From (14) we have with (13) 


*(77 o)” ne o 1 € i î i 
(16) ox (E, 0) = (n — 1)! 4 ste lim yz DOES: Telaio D) , 


where 7°... 7), is the i-th ordered sample. Going back to the variables &,, 
each ordered sample 7... T°, yields a sample ¢\”... el: 


“1 n* 


(17) (ee) LE, TO — TY +m, .., L— TO + m,) 


and 


O(T,, Trio T—T,_1) = 6, Vei— mi? ... 8, Ve — Mi -4(0, 86. En) - 


(*) The ordering is responsible for the factor (n — 1)! 


n 
2 
n 
Le ra 
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For v, we take the form (cf. A.9)) 


À 


Oe Bass Oy) == ana. 


Il tie kg ies 
Van. Ver mi = 5a a. [a ava n) oi Vez — mò) i 


n LO20O»n 


We have therefore 


ca di 
DIECI TT data ine! Pr(E1--- En) 
with 
(18) Yn(E1... En) == Eo end Og LAT (> o,Ve? — mè mi He (> oNe— m? 2)" È 


With this definition 


E— M 2 
( jte se a)" FES RD tale ae. Ge se) 5 


(19) ox (Æ, 0) = (n —1)! (n—3)! No N 


The samples e, e! ... e are constructed according to (17) and at the same 


time one calculates the corresponding set of square roots VER — m2, 


5. — The spectrum. 


Another advantage of this MC method for solving the phase space problem 
is that by the very nature of the computation process one gets easily without 
further calculations the energy spectrum for each kind of particle considered, 
more accurately: that part of the spectrum which is determined by phase space 
considerations. 

We consider here the (unnormalized) spectrum of the r-th particle, which 
because of the assumed individuality means a quite definite one. From (3) 
we see that in the C.M.S. (P= 0) the quantity 


(20) dp, 4x p?-0# 1(E — Vpi + m2, p,) 
is proportional to the number of particles «r» with momentum between p, 
and p,+dp,. 
The function 0 (E, 0, £,), defined as follows: 
(21) o(E, 0, e.) = Are, Ve? — m?-0,* ,(E —e,, Ve? mè) 


allows then a similar interpretation. 
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Now from (9) 


(22) of (H—e,, p,) = (any de, “| de. [de faes: 


my, My 


Uy (Ey) «+ Wr—a (Er—1) Weta (€741) +. Un(En) ‘Una (Pro €13 ces Er 19 Er413 vey En)” 


‘OE —e— Ve. 


i£r 


Furthermore with ¢, =p? + m? from (A.7) 
(23) Une1 (Pri Ex cos Epa EX ce En) = da (0,8: nai 


Since also 6(H —e,— > &;) = Ò(E— > e;), we see that the differential spectrum 
of particle «r », al o, (E, 0, €,) is obtained from (9) if de, is taken outside 
the integrals, i.e., if the integration 
over e, is omitted. From this follows 
then the almost obvious way to calculate 
On (E, 0, £7). i 

For each kind «r » of particle a set 
of Z storages is provided and the total 
energy # is divided into Z intervals 
Ae= E/Z, each cell corresponding to 
one storage. 

At the same time as the samples 
€... € are prepared, one looks in each 
sample for the value of e and determines in which of the cells Ae it lies. 
Then one adds into the corresponding storage the number y, (£” ... e?) (see (19)). 

After completion of this distribution of y’s (which takes place of course 
during the calculation of 0*(E, 0)) into the cells, the whole distribution is 
multiplied by 


(hr 
en (E.O.E-)A£ 


Fig. 2. — Typical form of a MC 
spectrum. 


if (E esi M)"-1(2x7)" 1 


N (n—1)!(n—3)! 


and the result is (2, 0, €), the spectrum for the particle «r». Indeed: this 
distribution means nothing more than the omission of the integration over 
de, and represents the integrand as a function of Exe 

The accuracy will of course be smaller than for 0) (EZ, 0) itself, because of 
the subdivision of the total statistical information, but one obtains without 
additional work at least a rough idea of the spectra. 

In a following paper we shall give details of the actual performance of such 
a calculation including estimates of errors and a flow diagram for an electronic 


computor. A brief report concerning our experiences in practical calculations 
will be added. 


ba à 
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APPENDIX 


The random walk function. 


Given a set of n vectors with fixed lengths p,... p, but random directions 


e... en. What is the probability w,(P, p,...p,)dP, that the resultant > pe; 
lies in the neighbourhood dP of P? i$ 
First we make two simplifying remarks: 
1) Since each e; has a random direction, the same holds for the resul- 
tant. w(P, p,...p,) can only depend on |P|. We write therefore w, (P, P1- Pa). 


2) The order in which the vectors are added is irrelevant, w, must be 
symmetric in p,...p,. Even more: the probability density for n vectors 
Pi---Pn to have a resultant P must be the same as for the n+1 vectors 
P, p,...p, to have the resultant zero. Hence 


Wn+1(0, Pre Pry P) == WEE, Pi... Da) 


and since the left hand side is symmetric in p, this must be true also. P 
for the r.h.s. 


We now derive Eq. (5). 
The required probability is obviously proportional to the number of pos- 
sible realizations, viz. 


(A.1) WAP pyc. De) = XK, (P — > pie:) de,... de, , 


where we have imposed the condition by means of a 6-function and integrate 

thus over all possible ways of choosing directions, which give the wanted 

length of the resultant. K, is a normalizing factor to be determined later. 
Introducing a Fourier representation for the 6-function 


rey = 
(A2) S(P— pe) = (2) [opta Spe) lar, 
we find 
he à 
UP, Aye De) = oma] de,.. de, exp [iA(P — Y p,e,) | 

or 

Li à e O a 
(A.3) NP, Dia) = fa exp[iAP]:TI | de exp[|[— ip;ke]|. 

Ik 


With polar co-ordinates de = 2x sin 0 dû one has 


4x sin p; 
Joe exp [— ip,;Ae] = aos i 
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Since the product does no longer depend on the direction of À we may integrate 
exp[iAP] also over all directions of A with the same result and find 


Le] 


Ke , sin PA sinp,A sinp,da 
W mn = EA n+1 2 . ia LÌ 
(A.4) WON Di. Da) Ca (4x) fi dA PR ni Di 
The normalization requires 
Po 
(A.5) lim free, Dice Dal ate APE, 
Pyro 


0 


(It is necessary to consider this limit, because we have reversed the order of 
integration and thereby obtained a singular function.) We obtain 
Po (ce) 
2 


SI LA, 3 e 
ta wl Pa ... Dn)P2 AP = Tay [Ae Nr) i ASE oe Pale 
0 0 


DIA V'Opel tin 


Now the second term in the bracket makes the integral vanish for P, > co 
whereas the first one gives 


Po +o 
K,(4m)"+? x 1 (,, sin AP, sindp, sin Ap, 
4 WCE, Di... Da) Pa dPi= = de A ia à 
"fw (P, Pio Pa) (2a)? 2 SIL A Ap, AP n 
0 — œ 
Since 
1 sin AP, 
lim — °- 
re IT À o() 2 
we find 


tal w,(P, Pie Dy)P2 AP = (4n)*-K,=1. 
0 


Hence the normalized random walk function is 


1 


F6) se CP, mm) = (a) [ow — ÿ pie.) de, ... de, = 


1 sin PA sin p,A sin p,À 
PI ALE À? di = eee - 
2702 Ph PiÀ Pad 
0 
We observe at once the complete symmetry not only with respect to: 


Di... Pn but also with respect to P, p,... pn, which all may be arbitrarily per- 
muted. A simple consequence is the important equation 


(A) Wn(0, Pa... Pn) = Wn-1(Psy Dr ++» Dia Dita ee Pn) 5 j = 1 


which we found already at the beginning from general arguments. 
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We now evaluate (A.6) for P—0 (which, because of (A.7), is no re- 
striction). Written with exponentials and the A-integration going from — co 
to co, (A.6) gives 


1 
w,(Q, Pie Po) = 4 p p 4 
1 eee n 
1\"( da | 
(ea) SR (exp Lin 4) — exp Lip) (exp Lied] —exp[— red) - 


Since the integrand does in fact contain no singularity, we could shift the 
path by — ir into the lower half plane. Evaluating the product and closing 
then the path in the upper or lower half plane according to where the integrand. 
vanishes, we find for the integral 


© — 27 


dA 
pee Do... on exp [iA > op] = 


— 0 —iT 


271 dr 
DE (n — 3)! dar ap 01... On EXP [td > OiPi Tapco : 


Here every o takes the two values +1 and — 1 and the sum is over all 2” 
sign combinations, except those for which Ÿ o;p;< 0. (Because for these 
the path is closed in the lower half plane with residual zero). The differentiation 
gives finally 


1 JL 
(A.8) w,,(0, Di... Pa) = Then reines Dido ora Date oe 


Palo i EE 


For practical calculation of the sum over o we remark that to every sign 
combination there exists the inverse one and one of them gives > CP: > 0. 


We need therefore sum only over one half of all sign combinations provided 
that we reverse all signs if > 0;p;< 0. We may therefore fix 0, = +1 and 


vary all the rest. This gives just one half of the possible sign combinations 
On... On and no two of them are the inverse of each other. Thus the whole 
class [o, = +1] is inverse to the whole class [o, =—1] and we shall now 
sum only over the former one. If for a certain combination 0, ...0, (01 = +1) 
one finds Y o,p;< 0, one has to change all signs 0, —>0,—=—0;, and to add 


oo op) (oso, (Don), 16,05 OP). 


That means: If Yo,p;<0, we have to subtract 01... 0, (> op.)""* instead of 
adding it. This is expressed simply by writing 


1 . 
(n—3)l21% 


(4.9). w,(0, D, Ds) = 


Pi. D 2 21 -+- On '88 (Yop) (Dop) 


O,=+1 
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where 


ne eg: for æ > 0 
ii ST for æ < 0 


is the sign function (*). 

An evident consequence of the meaning of the random walk function is 
that it must vanish identically as soon as any p, is larger than the sum of 
all the others: 


(A.10) wn(0, Pi Pr) =O if 2p, — > pi (for any j=1... n). 
t=1 


As this is not obvious in the present form (A.9) of the function, we prove 
it explicitely: 

Because of the symmetry one may take j=1. The square bracket of 
(A.9) must vanish. 


i) We observe that if 2p, >> pi, one has 
sg (Dop)=+1. 
ii) We expand the power in writing (>’ means î#1) 


2 02... On(Di + D' op)" = Y 04... on > [Pi (op). (o,p,)r] = 


o di, =n-3 


a is mts i PPS heres! es 
Dpr. pin Y oi +0841, gin tt, 


Lix=n-3 o 


ili) From Yi,=n—3 follows that not all 4,40. Let i;=0. Then 
k=1 
one has (}” means: 6/1 omitted) 


> = Dd 0;(V'oit1.., gint1) — 2 =D =0, q.e.d. 


o O;= +1 o 


The evaluation of (A.9) seems very easy on an electronic computer, if one 


(*) Ambiguities arise if accidentally a certain Dd op; = 0. In fact this gives a dif- 
ficulty only for n= 3, since in that case w(0, PiP2Ps) is a discontinuous function 
because of the triangle inequality. As a pure convention we put sg(0) = — 1. For 
n > 3 one has in such a case (> 0;p;)"-* = 0, which settles the question. In the pro- 
gramme of an electronic computor, however, this does not help the machine to know 
what to do with the sg(0), so we shall adopt the same definition for all ni 
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proceeds in the following way: 


n 
a) One calculates Dips which corresponds to 
j=l 


O71 


ilies 


I 


65 


= 


if 


b) One varies the signs in such a way that in the order 0303... 0, each 


o keeps its sign as long as possible (see example): 


ee I ll ——— 2 lil — ——— ———————Ù— — — 


DO te Oy, Oy CREA on O; Corresponds to 
| Sii | SE SE + Sh Pat Porto Pal Pare Ps 
a — + È ali = — 29; 
sE da ee = — 2Pa 
io — + + = = + 2%; 
Re ca nu a n — 2Ps 
+ A = > + — 2Ps 
| 4 + + a iù = + 2Pa 
ee + — + + + 2p, 
+ = % a Be 
— — = + oy — 2Ps 
+ AR = CASA — 2Pa 
- — = = + 2); 
x E ec =A a + 2s 
eae = da È A — 2Ps 
| = ni ai > + 2pa 
| — = Me x > + 2P5 


—— 


Change against foregoing set > 0; P; 


The example shows that from combination to combination there is always 
only one addition to do and the sign of o,-o,...¢, alternates systematically 
Besides the additions, the machine has to do the (n —3)rd power and to ob- 


42 - Supplemento al Nuovo Cimento. 


' 
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serve the sign of }o,p,. The total number of additions goes therefore roughly 
with 2-1 and after each addition (i.e. for each Y o;p;) the power is to be 
calculated, that is 2"-1 times a (n — 3)rd power. We are unlikely to have to 
calculate cases where n is bigger than 12, the normal situation will have » 
running from 3 to 10 (only mesons produced) and from 3 to 5 or 6 (heavier 
particles produced). For n=12 one has thus — 2050 additions and 2050 ti- 
mes a 9-th power. Cases with n=7 may be most interesting. This gives 
~ 150 additions and 150 times a 4-th power. 

Since this number of operations has to be carried out for each sample of 
Pi... Pr, the calculation of w,(0, p,...p,) contributes a great deal to the total 
computing time. 


SUPPLEMENTO AL VOLUME IX, SERIE X N.- 2, 1958 
DEL NUOVO CIMENTO 3° Trimestre 


A Monte-Carlo Method 
to Calculate Multiple Phase Space Integrals - II. 


F. CERULUS and R. HAGEDORN 
CERN - Geneva 


(ricevuto il 18 Agosto 1958) 


CONTENTS. — 1. Error estimates. — 2. Flow diagram. — 8. Practical 
experiences. — 4. Estimates of computing time. — 5. Comparison with 
the usual approximation. 


1. — Error estimates. 


In a foregoing paper (‘), which will be quoted henceforward as (I), we 
presented a Monte-Carlo (MC) approach to the problem of calculating mul- 
tiple phase space integrals. We shall use the same notation as in (I). 

The proposed calculation can be done only on automatic computers and 
of course only with a limited accuracy. As a MC calculation is a statistical 
process, one must know how accurate its result actually—or better: pro- 
bably—is. Automatic computers allow one to include a check of accuracy 
within the programme, so that the machine stops or goes over to the next 
case as soon as the prescribed accuracy is reached. 

We therefore begin with an estimate of the error. This error estimate 
applies to a single calculation of a particular o*(E, 0). Actually, however, 
one will calculate this function for n = 3, 4, 5... and also perhaps for different 
energies. Since we know that it is a smooth function with one single max- 
imum if we vary either n or Æ (keeping the other fixed), we may get a better 


(1) A Monte-Carlo method to calculate multiple phase space integrals (1). Equations 
of that paper are quoted as e.g. (I, 15), whereas equations of this paper are referred 
to as e.g. (12), ete. 
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statistical accuracy for a whole table, than we would have for a single case. 
If, e.g. for fixed Z, one calculates a table for n= 3, 4, ..., n', then the accu- 
racy of a number from such a table will be better by a factor of roughly 
1 n'— 3 provided the values in the table are smoothed out by using any 
method of balancing the fluctuations. 

For the estimation of the individual error we proceed as follows: 

First a certain number N, is fixed (N,>>1) which is the basic number 
of samples or the basic number of contributions to the sum (I, 19). This number 
should be chosen so small—say 10 or 50 or 100—that one does not expect 
that N, terms make already a good approximation to 0*. The process has 
then to be repeated several times and its convergence watched. Each repe- 
tition will add again N, contributions. Thus, if the desired accuracy is reached 
with 2 sets of N, contributions, the number of samples is N=AN,. We in- 
troduce the following abbreviations: 


(1) o/= 0. (71,0) i.e. the MC approximation to 07(£, 0) (2) 
with N=AN, contributing samples (see 
(I, 19)). We want this approximation to 
fulfil some condition of accuracy to be 
specified later. 


UN 


3 5 do 8°) i.e. the partial sum of the y’s from the 
(2) Sy oa Ha u-th set of N, samples 


2 


(3) Z, = DD = Sve SI) 


u=1 


i.e. the total sum of y’s from the N —AN, 
samples. 


With these definitions is follows from (I, 19) that, 


A : (E — M}"-1(2x)"-1 i a È La 
Fori” mn nn Ge) oa 


Consider first 2 as a fixed number and suppose that 0, has been calculated. 
To get an estimate of the accuracy we can use only the accumulated data 
from the AN, samples and we therefore deal with a posteriori estimates only. 

We can say what would be the mean error (more accurately its r.m.s. 
value (*)) if we would calculate 0, many times independently. In fact, we cal- 
culate it only once and may have found a value which differs much more from 
the exact value than just by one mean error. There is no way out of this si- 


(2) We omit throughout the star and write simply 0, and o 108 convenience. The 
qitaitities,: however, correspond to’o* as defined in (I). 
(*) r.m.s. means: root mean square. 
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tuation, because it is a logical consequence of the MC method, and common 
to all sampling procedures. Even if we would really calculate many times 
and take the mean value, then this mean value would simply be another single 
(though more accurate) calculation of 0, where now /’ is many times larger 
_ than 2 was. We can then carry through the same reasoning for 1’, which we 
gave above for 4, and we come to precisely the same result. Only the mean 
error will be smaller. 

Let us consider a typical situation. If 0, has been calculated, then this is 
the mean value of 2 contributions S , (all independent of each other), and each 
one of these comes from N, samples. We may here forget about the latter 
fact, because N, influences only the magnitude of the fluctuations of the S Pe 
We shall consider therefore the S,, as random variables. We draw a histogram, 
which shows this situation: 


is the mean value and it is supposed that 


; 07 OQ 
Ti es et 
ee 4 "A 


Actually we do not know this limit and the only quantities which are at 
our disposal are those which we can extract from the above picture. 
For the following we need two different mean values: 


i) The mean value over the À contributions which have so far been 
calculated. This mean value can be inferred numerically from the actual data; 
it will be denoted by a bar. 


ii) We may imagine the above actual case to be one of a large ensemble, 
i.e. we may imagine 0, to be calculated L times in the same way (statistically 
independent) and all the L histograms drawn. They will yield L values 09° 
(/=1... L). The second kind of mean value is then over the L-ensemble. Here 
we take the limit L + oo and write < >. 


The contributions S fi have from now on a second label 7 such that she is 
the u-th contribution to the /-th histogram. Let # be any function of S, 
then i) and ii) mean 


il À 
=. si ASE À always finite 
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Whereas FO still depends on 7, <F> is clearly independent of 4. Of course 


——(A) —(A) ‘ 
Hint- FD = EM ae PR 
Vendo) 
but we are just interested in the deviations for finite À. 
We proceed now to estimate the error of 0, as a function of 4. With 
lim 9; = <Q) =0= AS) we have 


Jo 


A c Ù Y Y Y 
ie = (3 8,—/4)* = 4D AS,AS,; AS, = 8, — <8). 


Vu 


Since all contributions are statistically independent, we find with (5) 


Nas 
FE 


A? 
“(07 REI 0) 7 5 ES <( AS) a eer (AIS? 
CENT, = ( 


t=1 
We now define (00), = V <(0, — 0)?) and have 


(eh _Vi(ai—e)) _ 1 V<AS*) 


0 
(6) 2 = = 
0 0 VAS 


In the following discussion we shall assume that the distribution of S, 
around S is more or less a Gaussian distribution. We use this assumption only 
for estimates of orders of magnitude. If N, is large enough, the fluctuations 
of the S, will be small and in the neighbourhood of <8) the Gauss distribution 
will be a good approximation. This has the advantage that we can calculate 
some quantities explicitly. Though the results do not hold strictly, they re- 
present at least good estimates. 


If <AS*> (and hence do = V {A0®) were known, we could state confidence 
limits for 0, starting from the calculated 0,. Assuming a nearly Gaussian distri- 
bution for $,, it is well-known that 

the probability that |e —e,|<de, is = 68% 
» ) ) le —e,l<2°00, is = 95% 
» » » lo—o,|<3-de, is = 99.950. 

Then, in order to obtain a given probability that the relative error is 
smaller than a given (small) number, we only have to choose the corresponding 


A large enough. Unfortunately neither <AS?) nor <S> are actually known. 
We may, however, without great error replace in (6) €S> by 8S”, because do/o 
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will be changed only by an amount of the second order. So we redefine 


(00); «cl VAS) 


6 a ae 


Zanni 
I 
_ 


There are now many ways to estimate (AS?) from the sample of 2 elements S,, 
e.g. using the largest occuring deviation or the r.m.s. deviation from the mean 
value. These methods, of which Student's test is the most adequate, have 
several practical drawbacks, the main one being that after each addition of 
a new set of samples the mean value changes and all deviations have to be 
calculated anew. 

These disadvantages may be avoided in the following way: We use the 
quantity 6S” which we define as follows: 


(8) és = 


That means: After each step of the calculation we take the mean value S‘” 
and find the absolute value of the deviation of the last contribution S, from 
this mean value. The average of all these absolute values is our 68”. The 
The advantage is that the mean values 8“? are calculated anyway, because 
they yield the @,, and 8, is just calculated as the last contribution. The 
absolute value of only this one difference is taken, added to the corresponding 
quantities of the foregoing steps and the mean is taken; neither are new dif- 
ferences calculated, nor are square roots involved. Since S tends to <S) 
as u > cc, it is easy to show that 


(9) lim 68S” = <[AS]>. 


But, since we do not intend to use large 2, we must relate the new quantity 
to <|AS|) or <A8?) also for finite 4. 

For this purpose we use the assumption of a Gauss distribution, since this 
allows explicit calculations. We shall calculate the expectation value of 5S” 
and express it by <AS°» under the assumption that the distribution of S, 
about <S> follows a Gauss law 


(10) AS = S—<8> 


and 


+0 


: fresras =A. 


= .@: 


à AS? 
Se ET AT TT TN DE a EL, ST N 
Vans) Pl 2485) 


(11) w(S) = 
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Then the expectation value (5 ii) is defined as 


+ © 


(12) (FY = / F(S) w(S) ds . 


— o 


We need for the following a simple consequence of (11): As is well known. 
the mean value of a sum of random variables follows again a Gauss distri- 
bution, if the random variables themselves follow such à distribution. The 
r.m.s. deviation of the mean value of S” from <S> is then, as shown in the 
derivation of (6) 


on \ a di } = 
(13) <(8°°— (8))*> = A, 8?» = = (A8*). 


The probability that S° lies between x and #-+dx, is therefore according to 
(11) and (13) given by 


da (e — <8>)? | 
14 W,(”) da = —————___ ex re 
de) DI V2x -(1/n)<AS?> | 2(1/n)<AS?»> 


We determine now the probability distribution of the difference 


(15) o, = SI — 8 


pl 


in (8). We must take care of the fact, that SN‘ is not independent of S, and 
write, therefore 


DER) 
t=1 LH 


Es 


where now Sf” is independent of S,: We put Ret 8 es Y,. The pro- 
bability that y <Y, Sy +dy, is then ae 


+ © 


vA(y) dy = ay] Waly + S,) %(8,,) ds, 


and with (14) and (11): 


bye 1 a 
(16) bly) See Se ee | 3 
div (u/(u — 1) 248% | (uf(u— 1) 2485) 


The distribution of 6, is then according to the general formula 


Vif(w)} df(æ) = v(x) dr 
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given by 


1 or 


SP | ((@= 1/4) 24485] 


i si V ((u —1)/u) 27r<A82) 


LU 


This is again a Gauss distribution with, as a glance on (11) shows: 


(18) Pe TN 
be 


A simple calculation shows now that for a Gauss distribution abont zero 


ee 
(19) <|æl> == 2 fat dae= Eve 
JE 
0 


This gives 


>= [2 2 vas, 


(1) 
CET A 


Vb 
X 


e 1 7 4 2 
GS" > => > </8" — 8,1), 
u=2 
with the result 
wa, ras id ed 
GS > = |/ÉV<A8 = ei 
(20) HAE 83 > | F 
or 
“SS 3” S D 
= i lr IENE 
VSS Tae COR TEASER i ai 
(21) bale 
A el 
À u =2 pl u=2 U 


This can now be introduced into (7). Of course, we find ourselves here before 
the same difficulty as with (7). What we actually know is not <68S, but 
only that particular 6S” which we can calculate using the actually computed 8. 
As pointed out at the beginning, this is a logical consequence of the fact that 
we do not know the distribution, but only a finite number of drawn samples. 
We could now continue the argument and proceed to calculate the distribution 
of OS” around its expectation value 58%». We would then end up with 
essentially the same situation. 
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In fact the situation is not so bad; if we now replace 


{OS PS 
by the calculated value 


Is 


then for not too small À the actual 68” lies near its expectation value. The 
error in the error estimate, which we thereby introduce, is therefore only of 
higher order. Even if we would estimate the error wrongly by a factor 2 or 
so (which is very improbable), this would not cause any serious trouble. We 
shall therefore adopt the following convention for the estimate of the error: 


(22) - = | = - ae 12 >1. 


Here 2, is a given small number fixing the accuracy. À, > 1 means 2, greater 
that at least 5. This is necessary, since otherwise the mean value over the 
actual deviations does not mean very much. Note that for À — 1 the expres- 
sions would not be defined since numerator and denominator become zero. 
The sum in the denominator, 


‘an be calculated for Z= 5, 6,..., 20, or so, and be stored in the computer. 
Then no square roots are necessary during the programme and the check of 
the accuracy by means of (22) becomes extremely simple in an automatic 
calculation. 

If, depending on the construction of the computer, it is simpler to take 
a square than to take the absolute value of a given number, then one can use 
instead of (22) the following formula, which is derived in essentially the same way: 


with the same 2, as above. The factor V7/2 and the roots in the denominator 
have disappeared. The sum in the denominator can here be calculated for 


= 
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each check by the recurrence formula, since the sum for 7—1 is already 
there from the foregoing check. 
It should” be noted again that if 


(00); 
Q 


SD. 


this means that the probability that the actual error is <<, is & 68% and 
so on (see discussion following eq. (6)). Even that is only true for large À, 
since we replaced <6S®»> by the calculated 5S”. To be sure, one can always 
check a few results by Student’s test and make 2, smaller if necessary. 

In the following flow diagram (22) and (23) are considered separately. 


Z. — Flow diagram. 


The following flow diagram has been developed in collaboration with mem- 
bers of the «Institut für praktische Mathematik » Technische Hochschule, 
Darmstadt, Germany. 

It may therefore show some features which are due to the particular pro- 
perties of the IBM 650 computer that was used for the calculation. Still, it 
is felt that this flow diagram will remain essentially the same for any type of 
computer. Therefore it was considered useful to reproduce it here, since it 
involves som little tricks. | 

A few comments: 


a) Input data 
T, No, n, A, Ae are defined in (I). 
À and 4, are defined here by eqs. (22) and (23). 
M ... M, are the masses of the particles. 
The most convenient values for À, and N, and reasonable values for &, 
depend on the total energy and on the number and masses of the particles. 
À should at least be of the order of 5. The smaller we choose N,, the 
larger À becomes and the better one can observe the convergence: see 
discussion under h). 


b) Instructions before (2) 
W,, gives the spectrum. 
(A/A)W,, will be the 0% (E, 0, ¢,) Ae (of (I), Fig. 2) in the k-th cell. In this 
programme each particle gets its own spectrum, whether there are iden- 
tical particles or not. If there are equal particles, one takes in the end 
the mean value of their spectra thus gaining statistical accuracy. It is 
convenient to normalize the spectra of the different kinds of masses and 
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to print out directly the normalized spectra together with the mean energies. 
Since the kinetic energies are as readily available in the computer as the 
total energies are, one may express the spectra as functions of either vari- 
able. (See discussion under g)). 

À, X',, D, d are quantities used during the calculation and the error estimate. 
(See below). 


Between (3) and (4) 

Here the random numbers are produced. In the present flow diagram it 
has been assumed that there is a subroutine available which produces 
(pseudo-) random numbers of three digits between 0 and 999. They are 
called ZT’ and are mapped on the interval 0 <T,<T. 


Between (4) and (8) 

Here the n —1 random numbers 0 < 7, < T are reordered according to 
magnitude and the variables which determine y (I, 18)) are prepared. 

Ff = &,°&"&3...€, is an abbreviation only. 


Between (8) and (9) (or (14) respectively): 

It is checked if perhaps one p; is larger than the sum of the other p’s. 
If that is the case all the calculations up to (14) are unnecessary, since 
according to (I, A.10) they yield zero. It is doubtful whether this check 
gives a gain in computing time, as the probability for «yes» is (5)! 
(n particles). Thus it saves only in a very few cases a long, useless cal- 
culation, but on the other hand it takes itself a time proportional to n. 


Between (9) and (12) 
the function y is calculated. 


2, is the square bracket of (I, 18). 


Z,=YoVe-m, (I, 18). 


i 


That part, which begins at (11) and feeds itself back into (10) calculates 
the sum 2, following exactly the example given in (I, Appendix). It is 
meant rather symbolically, since most computers do not have an operation 
which checks whether a number is even. There are many ways of replacing 
this by an operation which is much faster and implies essentially the same. 


Between (12) and (15): do 
The spectra are calculated for each particle separately according to (I). 
W,, is a storage matrix, which contains (for r fixed) the spectrum of par- 
ticle with number r. See discussion under b). 
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Between (15) and the end. 

The accuracy is checked. S contains now the sum of the y’s coming from 
N, samples and 2, is the sum of 2 such S. It seems desirable that the 
machine prints out S, and (1/2) Z, which allows one to draw histograms 
such as shown in Fig. 1. This visual control often gives a much better 
insight than a r.m.s. value or similar quantities. Since the convergence 
of the MC approach seems to depend on the energy, number of particles, ete., 
it is advisable to have this extra check here. In order to have a summary 
of the actual situation, also z, (or 2; respectively) (see here (22) and (23)) 
are printed out. D and d are numerator and denominator respectively 
in (22) and (23) of this paper. 

Finally, one may have chosen the desired accuracy so high, i.e. 2) So small, 
that it would never be reached in a reasonable computing time. There- 
fore the possibility is provided for finishing the calculation and printing 
out the present results, even if 2, is not < 2. 


If the square of the relative error is estimated by using the squared dif- 


ferences to the mean value, the flow diagram is changed only between 
(16) —...— (17) as follows: 


À D 
2 27 
(Ex? ‘à 
print out Z 


2 
(HEg-S) + D D 


Al ged 


c) 


Flow diagram. 


Since here the quadratic error is estimated, 4, has here to be the square 


of the 2, in the other method. 


In the other method V PEST and V xA)2 can be calculated for the in- 


teresting A-values and stored as a small table. Then the lengthy square roots 
are eliminated from the program. 


8. — Practical experiences. 


primary (laboratory) energy. The cases 2N +7 and 2N +27 could be checked 


At present a programme is running for nucleon nucleon collisions at 25 GeV 


with the formulae given by BLOCK (4) and showed good agreement. As the 
phase space integral can be solved rigorously for massless particles, we used 


(4) M. M. Brock: Phys. Rev., 101, 796 (1956). 
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a few such cases for various other checks. All cases in which a check was 
possible showed that the actual error was of the order of the predicted one 
and in fact in most cases it was even smaller (compare the predictions in the 
text between Eq. (6) and (7)). 

We shall now give one typical example in some detail and summarize the 
others in a table. 

Example: We took the following data: 


Po p=, = mM, = 0; = T= 0.9916 (5) No 10s “Aas 


The total number of samples was therefore 370. The prescribed accuracy 
2% was taken to be zero, so that the machine continued to calculate, until it 
was stopped. It seems most convenient to 
represent everything in the form of histo- 
grams as in Fig. 1 and 2. 

Fig. 2 shows the contributions #,, each 
coming from N,—10 samples We have 
drawn in the mean value 8%— (1/4) 
(A= 37) and the estimated error limits as 
well as the exact value, which in this case 
lies inside the estimated limits. 

Fig. 3 shows (1/u)2’, for u = 2, 3,...,37 again together with the exact value. 
Here the MC value approaches the exact one from above. In other cases it 
came from below and in still others it oscillated. 


mean vate} £, 5% 
exact value from 
analytical calculation 

TN ILN/7) 44 ie : 


VAZZA waa [774 LA [4722222} VLA LZ LLL 


ey 


Big. 1. 


Sy (arbitrary units) 


estimated error: 7.7°o 
actual ‘error : 6.2% 


Fig. 2. 


(5) Units are i —c=1; the energy unit is the nucleonie mass: Jf = 1. 
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Fig. 4 shows the estimated error in percent. One sees here that it takes 
some time before this a posteriori error estimate begins to have a meaning. 
In the beginning it fluctuates very much and only for 2 greater than about 


a=W} 
9 Su 


Z, (arbitrary units) 


exact value 


Fig. 3. 


15 does it become reasonably smooth. This shows that in this case A, — 15 
would have been reasonable (A, is that number of sets of N, samples, beyond 
which the machine would take the error estimate seriously, compare it with 


(2), in percent 


0 Ò 10 15 20 25 30 3504 
Fig. 4. 


the prescribed error 2, and stop as soon as the estimate is less than that). 
Of course, A, is roughly inversely proportional to N, and here we took N, 
small on purpose, so as to obtain an illustrative example. For comparison a 


curve const/Vu has been drawn in. The constant was adjusted such that this 


curve passed through (9/0), 


Le. — 


{ La in, i i pa di ap; 
nl Tx ! 
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Fig. 5 shows the MC spectrum as a histogram together with the exact 
analytical expression (full curve), which was normalized to the same area. 
It is interesting to compare this 
with not only the histogram, but 
with the full information, which 


2 (€) Ae (arbitrary units) 


one could obtain from it: the 20} 

dotted curve shows a fit by a | 

polynomial, the coefficients of 15f analytical curve 
which were calculated by a least least squares fit. 
Squares method. In this calculation ah 

the normalization was accounted 

for by a supplementary condition. *f 

It is this dotted curve which one 


kinetic energy — 
would consider as the calculated 
spectrum and it seems to come 
out quite well. It should be noted 
that it would come out even better 
if the spectrum were smooth at the upper end. As one sees from the ana- 
lytical curve, it has there the character of a step function, which makes it a 
bit difficult for the interpolation polynomial to reproduce it very well. For 
particles with mass and for higher particle numbers nothing like that will 
happen. 


Further results for checking the method: Before application, the method 
was checked in a few further cases for which exact values were known. The 
calculations were done with only a few samples in the first two examples. 
The results are given in Table I. 


TABLE I. 
: CM kinetic È, ; ARR 
Particles energy Oxo Oexact J = 
3 predicted actual 
6 particles 0.991 6 1.29: 10-° 1.65: 107$ +23 % + 27.9% 
m 0 
6 particles 5.57 ERO 5.4 «104 + 23.5% + 5.9% 
m= 0 
3 particles 0.9916 15.72 14.67 + 7.6% — 7.2% 
My, = Ma = 1 
Mg — 0.15 


43 - Supplemento al Nuovo Cimento. 


Pa. si a 
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4, — Estimates of computing time. 


As already mentioned, the calculations were performed on a IBM 650 
computer. Our computing times could be translated into those for other com- 
puters by a suitable factor which may be inferred from well known data on 
such machines. 

Fig. 6 shows the computing time for 50 samples as a function of the total 
number of particles, n. The dotted part of the curve is an extrapolation based 
on estimates. It is seen that this function increases rapidly. It should be 
noted that: 


i) No attempt has been made in our case to achieve « optimal program- 
ming », 


ii) there are computers available, which are much faster than the IBM 650. 


h/50 samples 


» | (/BM 650) 
- 
RA 
10° #° 10° 
Le 
+, 
Pd 
Di 
# 

10 of 10 


estimated ed 


n 


number of particles number of particles 


10 15 n 5 10 n 15 


Fig. 6. Big. 07: 


Both facts together lead to the guess that on the fastest computers this time 
‘an be reduced by a factor of about 100. 

We have relatively less experience of the number of samples needed for 
a certain accuracy, since only a few cases were carried through until a high 
accuracy was achieved. Fig. 4, however, shows that the accuracy goes roughly 
with 1/V4. 

In the following Fig. 7 we show the number of samples versus the number n 
of particles which gives an accuracy below —8%. Some of the points were 
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found by extrapolating the trend of the error by means of const/V7 as shown 
in Fig. 4. The points partly represent mean values over several different cal- 
culations. The straight line is drawn tentatively. The present information 
does not allow a better determina- 
tion. (For n=5 the error has ob- { tm) for 28% accuracy 4 
viously been underestimated). (18M 650) vs 
It seems that in cases where the / 
kinetic energy T is small compared / 
to the total mass, the convergence / 
does not depend very much on the Le / 
number of particles. In contrast if T / 
is larger than M, the number of ne- / 
cessary Samples increases rapidly with 
the number of particles. This can 
be made plausible by a discussion 


; ’ 10' + 

of the formulae for y (I, 18). Taking 

two different samples and looking at 

what effect the differences of the x 
10 


samples have on y, it is found that y ° 
is more sensitive for differences of 


samples when 7 > M than when the number of particles 

4 en EI + mi dino 1 n si +__ 
mass is large, since in the latter 9 a > SOE 
case the masses determine the value Vig. 8. 


of y more than the kinetic energies 
do. Sensitivity with respect to differences between samples means slow con- 


vergence. It seems that Fig. 6 shows a pessimistic estimate, since it is drawn 


using mostly results with cases where 7 was larger than M. 

The total computing time necessary for reaching an accuracy of about 8% 
is then given by the product of the curves of Figs. 6 and 7. The result is 
shown in Fig. 8. 

This figure should not be taken too seriously because: 

i) it is based partly on very rough estimates, 

ii) in most applications for higher numbers of created particles less ac- 
curacy is necessary, since the total phase space volume is already so 
small that it contributes very little to everything which one normally 
calculates, as for instance the mean number of created particles, total 
spectra, mean kinetic energies, etc. 


Assuming that optimal programming on à faster computer leads to perhaps 
a reduction by a factor 100, then the method is expected to break down for 
13 particles if 8% accuracy is demanded throughout. If, however, the higher 
particle numbers are not very important, so that (20 +30) % accuracy is enough, 


ness. 9 * è) OU DT TE 
i 3 Pi 9 \ x 0 . 3 : 4 i QE 
4 4 d ‘ e “ Ce 
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then one may use the method up to 14 or perhaps 15 particles. At energies 
where ¢.g. in a nucleon-nucleon collision 13 mesons are produced with non- 
vanishing probability, the whole Fermi theory becomes rather doubtful on 
one hand and pure thermodynamical approaches begin to yield reasonable 
results on the other hand. Thus the MC method seems to cover essentially 
the most interesting region. 


5. — Comparison with the usual approximation. 


The usual procedure for calculating a phase space integral for, say, N nu- 
cleons and » mesons has been to consider the nucleons as very heavy and the 
mesons as particles without mass. Then one puts | 


|? for mesons 
(24) E=Vp+m x 


9 


3 for nucleons 
2m 


| m + 


and one can now calculate exactly on either assumption the integral 


(25) ox(E, P) =| Ô(E— > E;) 0(P — > p,) dp, ... dp 


by introducing the Fourier representations of the two à-functions. The total 


o* is then given by the convolution 


(26) bio og) | où(E —e, |P—p|) ox(e, p)dedp , 
{ « Monte Carlo which in turn is quite involved but can be 


ximated : 3 : 
Renee calculated by various approximations, e.g. 


the method of steepest descent or the 


N+N—-N+Nenn 


5 25 GeV È . È , 
l use of tabulated functions. As far as we 
ye know, only BELEN'KIyJ et al. ($) have con- 
| sidered the convolution (26), whereas other 
î authors tacitly assume that each system of 
2 mesons and nucleons, separately has total 

momentum zero. This would be a fair 
Eee rons = approximation only for the case that any 
o—_}+—_—— + of these systems contains many particles. 


Since this is not true, the result of such 
calculations will be even worse than with 


(9) S. BELEN'K1J el al.: Usp. Piz. Nauk, 62, 1 (1957). 


L 
\ 
, 
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the use of (26). As an illustration, Fig. 9 shows the factor between o* calculated 
using (24)-(26) and o* calculated by our method. We took 25 GeV primary 
energy. 


* OK OK 
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